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Abstract 

Various concepts of mean shape previously unrelated in the litera- 
ture are brought into relation. In particular for non-manifolds such as 
Kendall's 3D shape space, this paper answers the question, for which 
means one may apply a two-sample test. The answer is positive if intrin- 
sic or Ziezold means are used. The underlying general result of manifold 
stability of a mean on a shape space, the quotient due to an isometric 
action of a compact Lie group on a Riemannian manifold, blends the Slice 
Theorem from differential geometry with the statistics of shape. For 3D 
Procrustes means, however, a counterexample is given. To further elu- 
cidate on subtleties of means, for spheres and Kendall's shape spaces, a 
first order relationship between intrinsic, residual/Procrustean and ex- 
trinsic/ Ziezold means is derived stating that for high concentration the 
latter approximately divides the (generalized) geodesic segment between 
the former two by the ratio 1 : 3. This fact, consequences of coordi- 
nate choices for the power of tests and other details, e.g. that extrinsic 
Schoenberg means may increase dimension are discussed and illustrated 
by simulations and exemplary datasets. 
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1 Introduction 

The analysis of shape may be counted among the very early activities of mankind; 
be it for representation on cultural artefacts, or for morphological, biological and 
medical applications. In modern days shape analysis is gaining increased mo- 
mentum in computer visi on, image analys i s, biom edicine and many other fields. 
For a recent overview cf. iKrim and Yezzi (|2006h . 



A shape space can be viewed as the quotient of a Riemannian manifold 
- e.g. the pre-shape sphere of centered unit size landmark configurations - 
modulo the isometric and proper action of a Lie group (cf. iBredon dl972l)) 



conve ying shape equivalence - e.g. the group of rotations, cf. (jKendall et al 



19991 Chapter 11). Thus, it carries the canonical quotient structure of a union 



of manifold strata of different dimensions, which give in general a Riemannian 
manifold part - possibly with singularities comprising the non-manifold part of 
non -regular shapes at so me of which sectional c urvatures may tend to in finity, 
cf. (jKendall et all Il999l Chapter 7.3) as well as iHuckemann et al.l (|2010bh . 

In a Euclidean space, there is a clear and unique concept of a mean in terms 
of least squares minimization: the arithmetic average. Generalizing to mani- 
folds, however, the concept of expectation, average or mean is surprisingly non 
trivial and not at all canonical. In fact, it resulted in an overwhelming number 
of different concepts of means, each defined by a specific concept of a distance, 
all of which are identical for the Euclidean distance in a Euclidean space. More 
precisely, with every embedding in a Euclidean space come specific extrinsic and 
residual means and with every Riemannian structure comes a specific intrinsic 
mean. Furthermore, due to t he non-Euclide an geometry, local minimizers intro- 
duced as Marcher means bv lKendalll (Il990h may be different from global mini- 
mizers called Frechet means by ZiezoldT i 1977 ). and, neither ones are necessarily 
uni que. Nonethel e ss, carrying statistics over to manifolds , strong consistency 
(bv lZiezold ( 1977 ). Bhattacharva and Patrangenaru ( 2003 )1 and under suitable 
conditions, c entral limit theorems (CLTs) fo r such means have been derived (by 
IJupd ( 1988 ). Hendriks and Landsman! ( 19961 1998 ). Bhattacharva and Patrangenaru 
( 20051 ) as well as Huckemannl ( 2010al V). On shape spaces, various other c oncepts 
of means have been introduced, e .g. the famous Procrustes means (cf. IZiezold 
( 19771 ) iDrvden and Mardial (|l998l )). As we show here, these means are related 
to the above ones via a horizontal lifting from the bottom quotient to the top 
manifold, cf. Table [U In particular, since there are many - and often confusing 
- variants of Procrustes means in the literatur this paper introduces the ter- 
minology of Procrustean means standing for inheritance from residual means. 



manifold means 


shape means 


intrinsic 
extrinsic 
residual 


intrinsic 
Ziezold 
Procrustean 



Table 1: Three fundamental types of means on a shape space (right column) and 
their horizontal lifts to the respective manifold (left column). 

For a CLT to hold, a manifold structure locally relating to a Euclidean 
space is sufficient. This leads to the question under which conditions it can be 
guaranteed that a mean shape lies on the manifold part. 

Due to strong consistency, for a one-sample test for a specific mean shape 
on the manifold part, it may be assumed that sample means eventually lie on 
the manifold part as well, thus making the above cited CLTs available. To date 
however, a two- and a multi-sample test could not be justified because of a 
lacking result on the following manifold stability. 

Definition 1.1. A mean shape enjoys manifold stability if it is assumed on the 



2 



manifold part for any random shape assuming the manifold part with non-zero 
probability. 



A key result of this paper establishes manifold stability for intrinsic and 
Ziczold means under the following condition. 

Condition 1.2. On the non-manifold part the distribution of the random shape 
contains at most countably many point masses. 



Since the non- manifold part is a null-set (e.g. iBredonl (|1972|) ) under the 
projection of the Riemannian volume, this condition covers most realistic cases. 

We develop the corresponding theory for a general shape space quotient 
based on lifting a distribution on the shape space to the pre-shape space and 
subsequently exploiting the fact that intrinsic means are zeroes of an integral 
involving the Riemann exponential. The similar argument can be applied to 
Ziezold means but not to Procrustean means. More specifically, we develop 
the notion of a measurable horizontal lift of the shape space except for its quo- 
tient cut locus (introduced as well) to the pre-shape space. This requires the 
geometric concept of tubular neighborhoods admitting slices. 

Curiously, the result applied to the finite dimensional subspaces exhausting 
the quotient s hape space of clos e d pla nar curves with arbitrar y initial point 
introd uced by IZahn and Roskies (1972) and further studied by Klassen et al 



(|2004r ). gives that the shape of the circle, since it is a singularity, can never be 
an intrinsic shape mean of non-circular curves. 

As a second curiosity, 3D full Procrustes means do not enjoy manifold sta- 
bility in general, a counterexample involving low concentration is given. This is 
due to the fact that for low concentration, full Procrustes means may be 'blin- 
der' in comparison to intrinsic and Ziezold means to distributional changes far 
away from a mode. Included in this context is also a discussion of the Schoenberg 



me ans, recently introdu ced by iBandulasiri and Patrangenaru ( 2005 ) as well as 



by iDrvden et al. (2008) for the non-manifold Kendall reflection shape spaces, 



which in the ambient space, also allow for a CLT. Schoenberg means, as demon- 
strated, however, may feature 'blindness' in comparison to intrinsic and Ziezold 
means, with respect to changes in the distribution of nearly degenerate shapes. 
In a simulation we show that these features render Schoenberg means less effec- 
tive for a discrimination involving degenerate or nearly degenerate shapes. 

As a third curiosity, for spheres and Kendall's shape spaces, it is shown 
that, given uniqueness, with order of concentration, the (generalized) geodesic 
segment between the intrinsic mean and the residual/Procrustean mean is in 
approximation divided by the extrinsic/Ziezold mean by the ratio 1 : 3. This 
first order relationship can be readily observed in existing data sets. In partic- 
ular, this result supports the conjecture that Procrustean means of sufficiently 
concentrated distributions enjoy stability as well. 

This paper is structured as follows. For convenience of the reader, first in 
Section [2] Kendall's shape spaces are introduced along with the specific result 
on manifold stability, followed by a classification of concepts of means on general 
shape spaces in Section [3] The rather technical Section |4] develops horizontal 
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lifting and establishes manifold stability, technical proofs are deferred to the 
appendix. In Section [5] extrinsic Schoenberg means are discussed and Section [6] 
tackles local effects of curvature on spheres and Kendall's shape spaces. Section 
[7J illustrates practical consequences using classical data-sets as well as simula- 
tions. Note that lacking stability does not affect the validity of the Strong Law, 
on which the considerations on asymptotic distance in Sections H] and [7J are 
based. 

An R-package f or all of the computations performed is provided online: 
Huckemannl (|2010bl) . 



2 Stability of Means on Kendall's Shape Spaces 

In the statistical analysis of similarity shapes based on landmark configurations, 
geometrical m-dimensional objects (usually m = 2,3) are studied by placing 
k > m landmarks at specific locations of each object. Each object is then de- 
scribed by a matrix in the space M(m, k) of m x k matrices, each of the k 
columns denoting an m-dimensional landmark vector, (x, y) :— tr(xy T ) denotes 
the usual inner product with norm ||x|| = yj (x, x) . For convenience and without 
loss of generality for the considerations below, only centered configurations are 
considered. Centering can be achieved by multiplying with a sub-Helmert ma- 
trix % £ M(k, k — 1) from the right, yieldin g a configuration xW. in M( m, k — 1). 



For this and other centering methods cf. (jDrvden and Mardial . |1998, Chapter 
2). Excluding also all configurations with all landmarks coinciding gives the 
space of configurations 

:= M(m,fc-1)\{0}. 

Since only the similarity shape is of concern, we may assume that all configu- 
rations are contained in the unit sphere Sf n := {x € F k n : ||x|| = 1} called the 
pre-shape sphere. Then, Kendall's shape space is the canonical quotient 

: = S k m /SO{m) = {[x] : x £ S^} with the orbit [x] = {gx : g £ SO(m)} . 

In some applications reflections are also filtered out giving Kendall's reflection 
shape space 

RYt--=V k J{e,e\=S k JO{m). 

Here, 0(m) = {g £ M(m, m) : g T g — e} denotes the orthogonal group with 
the unit matrix e = diag(l, . . . , 1), e = diag(— 1, 1, . . . , 1) and SO(m) — {g £ 
0(m) : det(g) — 1} is the special orthogonal group. 

For 1 < j < m < k consider the isometric embedding 

^ S k m : x ^ (Jf) (1) 

giving rise to a canonical embedding HEj °-> Ef„ which is isometric w.r.t. the 
canonical intrinsic distance, the Procrustean distance and the Ziezold distance, 
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respectively, defined in Section [31 cf. ( Kendall et al. , 19991 p. 29), cf. also 
Remark 12.31 below. 

We say that a configuration in M. m is j -dimensional, or more precisely non- 
degenerate j -dimensional if its preshape x £E S m is of rank j. Moreover, for 
j > 3 the shape spaces E^ and R^j decompose into a manifold part (defined in 
Section [31 cf. also Section [3]) of regular shapes 

(£)')* ={[x]e E* : rank(a;) > j - 1} and (i?E^)* = {[x] G RE,] : rank(a;) = j} , 

respectively, given by the shapes corresponding to configurations of at least 
dimension j — 1 and j, respectively and a non void part of singular shapes 
corresponding to lower dimensional configurations, respectively. 

The following Theorem for intrinsic means, full Procrustes means and Ziezold 
means (also defined in Section [3J) of random elements taking values in RT,j 
follows from Proposition 13. 31 Remark 16.81 and the fact that RY,^ C Ej, contains 
all shapes in Ej^ of configurations of dimension up to j, 1 < j < m < k. 

Theorem 2.1. Suppose that X is a random shape on E^ assumes shapes in 
R^j (1 < j < m < k) with probability one. Then every full Procrustes mean 
shape of X and every unique intrinsic or Ziezold mean shape under Condition 
\1.2\ w.r.t. (i?Ej)* corresponds to a configuration of dimension less than or equal 
to j. 

The following theorem is the application of the key result applied to Kendall's 
shape spaces. 

Theorem 2.2 (Stability Theorem for Intrinsic and Ziezold means). Let X be a 
random shape on E^, < m < k, with unique intrinsic or Ziezold mean shape 
[/j] £ E^j 7 /i € and letl<j<m be the maximal dimension of configurations 
of shapes assumed by X with non-zero probability. Suppose moreover that shapes 
of configurations of strictly lower dimensions are assumed with at most countably 
many point masses. 

(i) If j < m then fi corresponds to a non-degenerate j -dimensional configura- 
tion. 

(ii) If j = m then jj, corresponds to a non-degenerate configuration of dimen- 
sion m — 1 or m. 

Proof. Lemma ETT1 teaches that for Kendall's shape spaces, all quotient cut loci 
are void. Since for Ziezold means, Remark 13.51 provides invariant optimal po- 
sitioning and Remark 14.21 provides the validity of ([S]) , Corollary 14.81 applied to 
i?Ej as well as to E^ states that intrinsic and Ziezold means are also assumed on 
the manifold parts of ^?E^ and E^, respectively. In conjunction with Theorem 
12.11 this gives the assertion. □ 

Remark 2.3. The result of Theorem \2.S\ is sharp. To see this, consider for 
a>(3>0,a 2 +(3 2 = l the pre-shapes 

x ( ° ° M v ( a ° M and z ( 1 ° M G S 4 

I o (3 o h y - { o -(3 o ) and Z ~ I ) e 52 ' 
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Then x and y correspond to non-degenerate two-dimensional quadrilateral con- 
figurations while z corresponds to a one- dimensional (collinear) quadrilateral. 
Still, [z] is regular in £| and it is the intrinsic and Ziezold mean of [x] and [y] 
in Y>\. Under the embedding RT>\ c — >■ £| we have the pre-shapes 
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e s 



Just as [x] = [y] in i?£^ so do x' and y' have regular and identical shape in £3 . 
However, [z 1 ] is not regular and it is not the intrinsic or Ziezold mean in £3. 



3 Fundamental Types of Means 

In the previous section we introduced Kendall's shape and reflection shape 
space based on invariance under similarity transformations and, including re- 
flections, respectively. Invariance under congruence transformations only leads 

to Kendall's size- and- shape space. More generally in image a nalysis, invariance 

may a l so be considered under the affine or projective group, cf. iMardia and Patrangenaru 
(|200ll 120051 ). A different yet also very popular popular set of shape spaces for 
two-dime nsional configurations mo dulo the group of similarities has been intro- 
duced by IZahn and Roskiesl (| 19721 ) . Instead of building on a finite dimensional 
Euclidean matrix space modeling landmarks, the basic ingredient of these spaces 
modeling closed plan ar unit speed curves is the infinite dimensional Hilbert space 



of Fourier series, cf. E assen et al 



(2004). In practice for numerical computa- 
tions, only finitely many Fourier coefficients are considered. 

To start with, a shape space is a metric space (Q,d). For this entire paper 
suppose that X, X\, X%, . . . are i.i.d. random elements mapping from an abstract 
probability space (Q,A,P) to (Q,d) equipped with its self understood Borel a- 
field. Here and in the following, measurable will refer to the corresponding Borel 
er-algebras, respectively. Moreover, denote by E(Y) the classical expected value 
of a random vector Y on a Z?-dimensional Euclidean space M. D , if existent. 

Definition 3.1. For a continuous function p : Q x Q — >■ [0,oo) define the set 
of population Frechet p-means by 

EW(X)=argmin MeQ E(p(X,/i) 2 ). 

For uj 6 O denote the set of sample Frechet p-means by 



E n\u) = argmin^Q ^ p{ X 3^)^ 



By continuity of p, the p-means are closed sets, additionally, sample p-means 
are random sets, all of which may be empty. For our purpose he re, we rely on th e 
definition of ra ndo m closed set s as in troduced and studied by IChoquetl (|1954h . 
iKendalll (1 19 74) a nd Matheron ( 19751) . Since their original definition for p = d 
by iFrechetl (| 19481 ) such means have found much interest. 



G 



Intrinsic means. I ndependently, for a connected Riemannian manifold with 
geodesic distance p^> , iKobavashi and Nomizu (|l969h defined the corresponding 



means as centers of gravity. They are nowadays also well known as intrinsic 



means by iBhattacharva and Patrangenaru (|2003l 120051) 



Extrinsic means. W.r.t. the chordal or extrinsic metric p^ due to an em- 
bedding of a Riemannian manifold in an a mbient Euclidean space, Freche t p- 



means have been c alled mean locations by Hendriks and Landsman! ([19961 ) or 
extrinsic means by IBhattacharva and Patrangenaru! (|2003l ). 

More precisely, let Q = M C MP be a complete Riemannian manifold em- 
bedded in a Euclidean space M. D with standard inner product (x,y), \\x\\ = 
\J {x,x),p^ (x,y) = \\x — y\\ and let $ : M D — > M denote the orthogonal projec- 
tion, $>(x) = argmin peM ||x — p\\. For any Riemannian manifold an embedding 
that is even isometric can be found for D sufficiently large, see lNashl(ll956l ) . Due 
to an extension of Sard's Theorem by (jBhattacharva and Patrangenarul 2003. 
p. 12) for a closed manifold, $ is univalent up to a set of Lebesgue measure zero. 
Then the set of extrinsic means is given by th e set of images <j>(E(Y")) where Y 
denot es X viewed as taking values in R. D (cf. IBhattacharva and Patrangenaru 
( 2003h ). 



Residual means. In this context, setting p^ r '(p,p') = \\d$ p t(p — p')\\ (p,p' € 
M) with the derivative d$ P ' at p' yielding the orthogonal projection to the 
embedded tangent space T p /R D —> T p iM C T p rR D , call the corresponding mean 

sets E^ p( ^(X) and En P (w), the sets of residual population means and residual 
sample means, respectively. Fo r two- s phere s, p^(p,p r ) has been studied under 
the name of crude residuals bv lJupd |l988). On unit-spheres 



p^(p,p') = \\ P - <p,pvii = ^i-{ P ,pr = P (r \p',p) (2) 

is a quasi-metric (symmetric, vanishing on the diagonal p = p' and satisfying 
the triangle inequality). On general manifolds, however, the residual distance 
p( r ' may be neither symmetric nor satisfying the triangle inequality. 

Obviously, for X uniformly distributed on a sphere, the entire sphere is 
identical with the set of intrinsic, extrinsic and residual means: non-unique 
intrinsic and extrinsic means may depend counterintuitively on the dimension 
of the ambient space. Here is a simple illustration. 

Proposition 3.2. Suppose that X is a random point on a unit sphere S ^ 1 
that is uniformly distributed on a unit subsphere S . Then 

(i) every point on S ^ 1 is an extrinsic mean and, 

(ii) if S is a proper subsphere then the set of intrinsic means is equal to the 
unit subsphere S' orthogonal to S . 
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Proof. The first assertion is a consequence of p^ e \x,y) 2 + p^ e \x, —y) 2 = 4 for 
every x,y G S ^ 1 . The second assertion follows from 

2 

p« (x, y) 2 +p^ (x, -y) 2 = p« (x, y) 2 +(ir- P ^ (x, y)) 2 > for every x,y e S^ 1 

for the intrinsic distance p^(x,y) — 2arcsin(|Ja; — y\\/2) with equality if and 
only if x is orthogonal to y. □ 

Proposition 3.3. If a random point X on a unit sphere is a.s. contained in a 
unit subsphere S then S contains every residual mean as well as every unique 
intrinsic or extrinsic mean . 

Proof. Suppose that x = v + v is a mean of X with G S and v G S ^ 1 

normal to S. Since 1 - (X, v + v) 2 = 1 - (X,v) 2 > 1 - {X, v) 2 /\\v\\ 2 a.s. with 
equality if and only if v — 0, the assertion for residual means follows at once from 
([2]l, For intrinsic and extrinsic means we argue with = (v— 

a.s. yielding v — in case of uniqueness. □ 

Let us now incorporate mo re of the structure com mon to shape spaces. The 
following definition is due to (|Kendall et al. . 1999, p. 249). We additionally 



require that the group acting be compact in order to ensure that the quotient 
be Hausdor ff. More gene rally, one could assume a non-compact group acting 
properly^ cf. iPalais (|!96lh . 



Definition 3.4. A complete connected finite- dimensional Riemannian manifold 
M with yeodesic distance g?m on which a compact Lie yroup G acts isometrically 
from the left is called a pre-shape space. Moreover the canonical quotient 

7T : M -> Q := M/G = {[p] : p € M} with the orbit [p] = {gp : g e G} , 

is called a shape space. 

As a consequence of the isometric action we have that <Im{,9PiP') — d{p, Q l p') 
for all p,p' £ M, g G G. For p,p' G M we say that p is in optimal position to p' 
if (1m(p,p') = rnin 56 G ^m (<7P, p'), the m inimum is attained since G is compact. 



As is well known (e.g. (jBredonl I1972L p. 179)) there is an open and dense 



submanifold M* of M such that the canonical quotient Q* = M* /G restricted 
to M* carries a natural manifold structure also being open and dense in Q. 
Elements in M* and Q* , respectively, are called regular, the complementary 
elements are singular; Q* is the manifold part of Q. 



Intrinsic means on shape spaces. The canonical quotient distance 
dQ{[p], [p']) ■= vamd M {gp,p') = min d M {gp,hp') 

g£G g,h£G 

is called intrinsic distance and the corresponding <iQ-Frechet mean sets are 
called intrinsic means. Note that the intrinsic distance on Q* is equal to the 
canonical geodesic distance. 
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Ziezold and Procrustean means. Now, assume that we have an embedding 
with orthogonal projection <J> : M. D — > M C M. D as above. If the action of G is 
isometric w.r.t. the extrinsic metric, i.e. if ||<7p— gp'\\ = \\p — p'\\ for al\p,p' G M 
and g € G then call 



Pq{[p\Ap']) ■= min||gp-p'|| and 



Pq\[pW\) ■■= min \\d%,{gp-p')\\ 

9 6 G, gp in 
opt. pos. to p' 

the Ziezold distance and the Procrustean distance on Q, respectively. Call the 

(z) 

corresponding population and sample Frechet pg -means, respectively, the sets 
of population and sample Ziezold means, respectively. Similarly, call the cor- 
responding population and sample Frechet p^g -means, respectively, the sets of 
population and sample Procrustean means, respectively. 

We say that optimal positioning is invariant if for all p,p' € M and g* G G, 

dM{g*P,p') = mmd M (gp,p') \\g*p-p'\\ = ™%\\gp-p'\\ . 
g&G gEG 

Remark 3.5. Indee d for Q = Ej^,i?Ej^, optimal positioning is invariant (cf. 
i Kendall et al , 199& , p. 206)), Procru stean m e ans c oincide with means of gen- 



eral Procrustes analysis introduced by Gowei \l97& ) and Ziezold 



means coin- 



cide with means as introduced by \Ziezold \l99k ) for E§. Moreover for E*, 
Procrustean means agree with extrinsic mean s w.r.t . the Veronese- Whitney em- 
bedding, cf. Bhattacharya and Patranaenarv] It 200 A ) and Section^ 



Procrustean means on E^ are also called full Procrustes means in the liter- 
ature to distinguish them from partial Procrustes means on t he siz e- and- shape 
spaces not further discussed here (e.g. iDrvden and Mardia (1998)). We only 



note that partial Procrustes means are identical to the respective intrinsic, Pro- 
crustean and Ziezold means which on size-and-shape spaces, all agree with one 
another. 



4 Horizontal Lifting and Manifold Stability 

In this section we derive a measurable horizontal lifting and the stability theorem 
underlying Theorem 12.21 To this end we first recall how a shape space is made 
up from manifold strata of varying dimensions. Unless otherwise referenced, we 
use basic term inology that can be found in any stan dard textbook on differential 
geometry, e.g. iKobavashi and Nomizul ( 1963lTl9 69h . For the results derived here 



we assume that the shape space is a quotient modulo a compact group. We note 
that these results r emain yalid in the more general case of a non-compact group 
acting properly, cf. IPalaid (Il96lh . 
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4.1 Preliminaries 

Assume that Q = M/G is a shape space as in Definition 13.41 T p M is the 
tangent space of M at p £ M and exp p denotes the Riemannian exponential 
at p. Recall that on a Riemannian manifold the cut locus C{p) of p comprises 
all points q such that the extension of a length minimizing geodesic joining 
p with q is no longer minimizing beyond q. In consequence, on a complete 
and connected manifold M we have for every p' £ M that there is v' £ T p M 
such that p' = exp p v' while v' = exp" 1 p' of minimal modulus is uniquely 
determined as long as p' £ M \ C{p). It is well known that the cut locus 
has measure zero in the sense that its image in any local chart has Lebesgue 
measure zero. From now on we call the cut locus the manifold cut locus in order 
to distinguish it from the quotient cut locus C quot (q) of q £ Q which we define 
as C quot {q) :— {[p'\ : p' £ C(p) is in optimal position to some p £ q} . Due to 
the isometric action we have for any p £ q that 

C quot (q) = {\p'\ : p' £ C(p) is in optimal position to p] C 7r(C(p)). (3) 

The following Lemma teaches that in general, the projection of the manifold 
cut locus, the manifold cut locus of the manifold part Q* and the quotient cut 
locus are different. In particular, quotient cut loci are void in the special case 
of Kendall's shape spaces. 

Lemma 4.1. C{q) ^ for every q £ while C quot (q) = for all q £ . 
Similarly C quot (q) = for all q £ RE^ . 

Proof. The first assertion follows from the fact that Yj\ is a compact manifold. 
For the second assertion consider [p] £ S^. Since C(p) = {—p} for p £ and 
[p] = [—p] for even m as well as for odd m if p is not regular, and, since p, —p are 
not in optimal position for odd m if p is regular, we have that C quot {[p]) = 0. 
The third assertion follows from the fact that [p] = [—p] for all [p] £ -RS^. □ 



Next we collect consequences of the isometric Lie group action, see iBredon 



(1972). 



(A) With the isotropy group I p = {g £ G : gp — p} for p £ M, every orbit 
carries the natural structure of a coset space [p] = G/I p . Moreover, p' £ M 
is of orbit type (G/I p ) if I p i = gl p g~ l = I gp for a suitable g £ G. If 
Ip C I gp i for suitable g £ G then p' is of lower orbit type than p and p is 
of higher orbit type than p' . 

(B) The pre-shapes of equal orbit type := {p 1 £ M : p 1 is of orbit type 
(G/I p )} and the corresponding shapes Q^vi :— {[p 1 ] : p' £ M^ IpS> } are 
manifolds in M and Q, respectively. Moreover, for q £ Q denote by 

the shapes of higher orbit type. 

(C) The orthogonal complement H p M in T p M of the tangent space T p [p] along 
the orbit is called the horizontal space: T p M = T p [p] © H p M. 
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(D) The Slice Theorem states that every p € M has a tubular neighborhood 
[p] C U C M such that with a suitable subset D C H p M the twisted 
product exp p D x j p G is diffeomorphic with £/. Here, the twisted product 
is the natural topological quotient of the product space exp p DxG modulo 
the equivalence 

(exp p v,g) (exp v', g') 3h G I p such that v' = dhv, g 1 — gh^ 1 . 

We then say that the tubular neighborhood U admits a slice exp p D via 
U = exp p D x Ip G. 

(E) Every p £ M has a tubular neighborhood U of p that admits a slice exp p D 
such that every p' € exp p D is in optimal position to p. Moreover, for any 
tubular neighborhood U admitting a slice exp p D, all points p' € U are 
of orbit type higher than or equal to the orbit type of p and only finitely 
many orbit types occur in U. If p is regular, i.e. of maximal orbit type, 
then the product is trivial: exp p D x j p G = exp p D x G/I p . 

Finally let us extend the following uniqueness property for the intrinsic dis- 
tance to the Ziezold distance. The differential of the mapping ff nt : M\C(p') — > 
[0, oo) defined by f LAp) = d M (p, exp„ p') 2 is giv en by dfl. Jp) = -2v wi th 



exp-V (cf. jKobavashi and Nomizul . Il969l . p. 110). iKarcherl (|1977l) ) 



Hence, we have for p\,p<i £ M\ C{p) that 

dfZip) = dfZiP) & Pi=P2- (4) 

In view of the extrinsic distance let f£ xt ■ M \ C(p') — > [0, oo) be defined by 

fextip) = \\P ~ P'W 2 = \\P ~ ex P P ( ex Pp 1 P')H 2 - Mimicking (g]) introduce the 
following condition 

dfM=dm t (p) Pi=P2 (5) 

for pi,p 2 G M\C{p). 

Remark 4.2. (0) is valid on closed half spheres since on the unit sphere 



4.2 A Measurable Horizontal Lift 

In order to establish the stability of means in Theorem 14.71 in the following 
Section 14.31 here we lift a random shape X from Q horizontally to a random 
pre-shape Y on M. In order to do so we need to guarantee the measurability 
of the horizontal lift in Theorem 14.41 below, the proof of which can be found in 
the appendix. 

Before continuing, let us consider a simple example for illustration. Suppose 
that G — S 1 C C acts on M — C by complex scalar multiplication. Then 
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[0, oo) = Q = M/G having the two orbit types (S 1 /^) = {1} and (S^h) = S 1 
gives rise to 

Q^o) = {0} and = (0, oo). Obviously, M admits a global 

slice via the polar decomposition [0,oo) x s i S 1 = {0} U ((0, oo) x S* 1 ) = M 
about £ M (the Riemannian exponential is the identity if TqC is identified 
with C). Here, X can be identified with its horizontal lift Y to the global slice 
[0, oo) C M. If, say, X is uniformly distributed on [1, 2] then ¥{X G > 0. 

In this case the stability theorem states the obvious fact that G cannot 
be a mean of X. 

Definition 4.3. Call a measurable subset L C M a measurable horizontal lift 
of a measurable subset R of M/G in optimal position to p G M if 

1. the canonical projection L — > R C M/G is surjective, 

2. every p 1 ^ G L is in optimal position to p G L, 

3. every orbit [p'\ of p' G L meets L once. 

Theorem 4.4. Let p G [p] G Q and A C Q countable. Then there is a measur- 
able horizontal lift L of QwY) y A in optimal position to p. 

Theorem 4.5. Assume that X is a random shape on Q and that there are 
peM and A C Q countable such that X is supported by ( y Q^UA)\C quot ([p\). 
With a measurable horizontal lift L of (Q^'-'uA) \ C quot ([p]) in optimal position 
to p define the random element Y on L C M by 7r o Y = X . 

(i) If [p] is an intrinsic mean of X on Q, then p is an intrinsic mean of Y 
on M and 

E^xp- 1 ^) = 0. 

(ii) If [p] is a Ziezold mean of X on Q and optimal positioning is invariant, 
then p is an extrinsic mean ofY on M and 

E(d/L(p))=0. 

(Hi) If [p] is a Procrustean mean of X on Q and optimal positioning is invari- 
ant, then p is a residual mean ofY on M. 

Proof. Suppose that [p] is an intrinsic mean of X. If p would not be an intrinsic 
mean of Y, there would some Msp'^p leading to the contradiction 

E(d Q (lp'],X) 2 ) = E(d M {p',Y) 2 ) < E(d M (p,Y) 2 ) = E(d Q ([p],Xf). 

(z) 

Hence, p is an intrinsic mean of Y. Replacing c?q by pg and dj^ by the Eu- 
clidean distance gives the assertion for Ziezold and extrinsic means, respectively; 
and, using the Procrustean distance p^Q on Q as well as the residual distance 

(r) 

p M on M gives the assertion for Procrustean and residual means, respectively. 

For intr insic means y G M, the necess ary condition E ( exp~ 1 Y) = is de- 
veloped in ( Kobavashi and Nomizu . 19691 p. 110), cf. also (jKarcher . 19771) and 



12 



( Kendal] , 1990, p. 395) which yields the asserted equality in (i). By definition, 



the analog condition for an extrinsic mean p € M is E(<i/ ( ^, t (p)) = which is 
the asserted equality in (ii) completing the proof. □ 

Remark 4.6. Since the maximal intrinsic distance on Ej^ and is 
— = max min arccos ( ti(qxv T )) = max min arccos ( tr(qxy T )) , 

2 x,yeS^geSO(m) V V " x, y eS^ gSO(m) v w 

taking into account Remark \4-.2[ condition (5p is satisfied for any horizontal lift 
in optimal position. 

4.3 Manifold Stability 

The proof of the following central theorem is deferred to the appendix. 

Theorem 4.7. Assume that X is a random shape on Q, p G M and that 
Ac Q is countable such that X is supported by (Q<-W> U A) \ C quot ([p}) and let 
p' E [p'\ E Q (M) . IfP{X e Q ( V)} ^ and i/ either [p] is 

(i) an intrinsic mean of X or 

(ii) a Ziezold mean of X while optimal positioning is invariant and {5|) is valid, 

then p' is of lower orbit type than p. 

We have at once the following Corollary. 

Corollary 4.8 (Manifold Stability Theorem). Suppose that X is a random 
shape on Q that is supported by Q\ C quot ([p\) for some [p] £ Q assuming the 
manifold part Q* with non-zero probability and having at most countably many 
point masses on the singular part. Then [p] is regular if it is an intrinsic mean 
of X, or if it is a Ziezold mean, optimal positioning is invariant and (0) is valid. 

Since Q \ is a null set in Q for every qeQ (cf. (|Bredonl Il972l p. 184)) 



and so is C quot (q) - by ([3]) it is contained in the projection of a null set - we 
have the following practical application. 

Corollary 4.9. Suppose that a random shape on Q is absolutely continuously 
distributed w.r.t. the projection of the Riemannian volume on M . Then intrinsic 
and Ziezold population means are regular; the latter if optimal positioning is 
invariant and {5|) is valid. And, intrinsic and Ziezold sample means are a.s. 
regular. 

4.4 An Example for Non-Stability of Procrustean Means 

Consider a random configuration Z € assuming the collinear quadrangle q\ 
with probability 2/3 and the planar quadrangle q2 with probability 1/3 where 

1 -1 \ / 1 1-2 

9i = I '92= 7f7f7f-^ 
0000/ VOOO 
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Corresponding pre-shapes in optimal position w.r.t. the action of SO(3) and 
0(3) are given by 




Pi = U U u , vi 




Note that [p 2 ] has regular shape in (£f)*. The full Procrustes mean of [Z] G S3 
is easily computed to have the singular shape [pi] € £3 \ (S3)*, see Figure [5] as 
well as Examples 16.51 and Section [7^1 Cf. also Remark [ 



5 Extrinsic Means for Kendall's (Reflection) Shape 
Spaces 

Let us recall the well known Veronese- Whitney embedding for Kendall's planar 
shape spaces E*. Identify F£ with C fc_1 \ {0} such that every landmark column 
corresponds to a complex number. This means in particular that z G C fe_1 
is a complex row(!)-vector. With the Hermitian conjugate a* = (oftj) of a 
complex matrix a = (djk) the pre-shape sphere S% is identified with {z G 
C fc-i . zz * = jj on which SO ( 2 ) identified with S 1 = {A G C : |A| = 1} acts by 
complex scalar multiplication. Then the well known Hopf-Fibration mapping to 
complex projective space gives = S% / S 1 = CP h ~ 2 . Moreover, denoting with 
M(k — 1, k— 1, C) all complex (fe — 1) X (k — 1) matrices, the Veronese- Whitney 
embedding is given by 

S2/S 1 -> {a G M(k — l,k — 1,C) : a* = a} , [z] t-> z* z . 

Remark 5.1. The Procrustean metric of E| is isometric with the Euclidean 
metric of M[k — 1, k — 1, C) since we have (z, w) = Re(zw*) for z,w G S% and 
hence, d^h[z], [w]) = >/l — wz*zw* = \\w*w — z* z\\/\/2. 

The idea of the Veronese- Whitney embedding can be carried to the general 
case of shapes of arbitrary dimension m > 2. Even though the embedding given 
below is apt only for reflection shape space it can be applied to practical situ- 
ations in similarity shape analysis whenever the geometrical objects considered 
have a common orientation. As above, the number k of landmarks is essential 
and will be considered fixed throughout this section; the dimension 1 < m < k, 
however, is lost in the embedding and needs to be retrieved by projection. To 
this end recall the embedding of Sj in (1 < j < m) in (fTJ) which gives rise 
to a canonical embedding of i?E" 1 in i?Ej^. Moreover, consider the strata 

(RXly := {[x] G RY, k m : rank^) = j}, (Ejj' := {[x] G £*, : rank(x) = j} 

for j = 1, . . . , m, each of which carries a canonical manifold structure; due to 
the above embedding, (iffijjj- 7 will be identified with (RY^y such that 

rn 

RYt = \J(R^) j , 
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and (iJE*,) J with (££,) J ' in case of j < m. At this point we note that SO(m) is 
connected, while 0(m) is not; and the consequences for the respective manifold 
parts, i.e. points of maximal orbit type: 

(str = (^tr- 1 u otr, mir = mlr- w 

Similarly, we have a stratifiction 

k-l 

V := {a e M(k- l,k- 1) : a = a T > 0,tr(a) = l} = [jv j 

3=1 

of a compact flat convex space V with non-flat manifolds V 3 ' :— {a £ : 
rank(a) = j} (j = 1, . . . , k— 1) , all embedded in M{k— 1, fc — 1). The Schoenberg 
map s : — > "P is then defined on each stratum by 

For x £ Sj recall the tangent space decomposition T^S* = T x [x] © H x Sj into 
the vertical tangent space along the orbit [x] and its orthogonal c omplement the 



horizontal tangent space. For x £ (S^y identify canonically (cf. ([Kendall et al 
19991 p. 109)) 



t [x] (keW ^ h x s^ = {we M(j, k-i)-. ti(wx T ) = o,wx T = x W T } . 



Then the assert ion of the following T heorem condenses results of lBandulasiri and Patrangenaru 
(|2005h , cf. also lDrvden et all (|2Q08h . 

Theorem 5.2. Eachs 3 is a diffeomorphism with inverse (s^~)~ l (a) = [(\/Au T ){] 
where a = u\u T with u £ 0(k— 1), A = diag(Ai, . . . , \ m ), o-nd = Xj+% = . . . = 
Afe_i in case of j < k — 1. Here, (a)\ denotes the matrix obtained from taking 
only the first j rows from a. For x £ Sj and w £ H x Sj = Tj- X ] (KSj ) J the 
derivative is given by 

d(s-')[ x ]W — x T w + w T x . 

Remark 5.3. In contrast to the Veronese-Whitney embedding, the Schoenberg 
embedding is not isometric as the example of 

cos 6 \ / sin (A \ / cose 



sin0 J ' Wl \ -cos0 J ' W2 \ sin0 

teaches: \\x T Wi + wfx\\ = \/22| cos<^sin^|, ||x T i«2 + W2 , a;|| = \/2- 

Since V is bounded, convex and Euclidean, the classical expectation K(X T X) £ 
V 3 for some 1 < j < fc — 1 of the Schoenberg image X T X of an arbitrary random 
reflection shape [X] £ is well defined. Then we have at once the following 

relation between the rank of the Euclidean mean and increasing sample size. 
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Theorem 5.4. Suppose that a random reflection shape [X] € is distributed 

absolutely continuous w.r.t. the projection of the spherical volume on Sf n . Then 

n 

E(X T X) e V k ~ x and - V XTX t e 7? min («™^" 1 ) a .s. 

i=l 

for every i.i.d. sample X%, . . . , X n <~ X . 

Hence, in stastical settings involving a higher number of landmarks, a suffi- 
ciently well behaved projection of a high rank Euclidean mean onto lower rank 
V r = (Sjf) r , usually m — r, is to be employed, giving at once a mean shape 
satisfying strong consistency and a CLT. Here, unlike to intrinsic or Procrustes 
analysis, the dimension r chosen is crucial for the dimensionality of the mean 
obtained. 

The orthogonal projection 

V : \Ji=r "P* -> V r , a ^ argmin beP . tr ((a - b) 2 ) 

giving the set of extrinsic Schoenberg means has been computed bv lBhattacharva 
(|2008h : 

Theorem 5.5. For 1 < r < k — 1, a = u\u T e V with u e 0(k - I), A = 
diag(Ai, . . . , A m ), Ai > . . . > Afc_i and X r > the orthogonal projection onto 
V r is given by (jf{a) = u/iu T with \x — diag(/ii, . . . , /i r ,0, ... ,0), 

Mi = A, + - - A r (i = l,...,r) 
r 

and A,- = - 53i=i — r w hi c h * s uniquely determined if and only if A r > A r +i. 

With the notation of Theorem l5.5l a non-orthogonal central pr ojection ijj r (a) = 
uvu T equally well and uniquely determined has been proposed bv lDrvden et al. 
(|2008l) with 

i/ = diag(i/i,...,i/ r ,0,...,0), Vi = (i = l,...,r). 

rA r 

Orthogonal and central projection are depicted in Figure [T] 



6 Local Effects of Curvature 

In this section we assume that a manifold stratum M supporting a random 
element X is isometrically embedded in a Euclidean space M. D of dimension 
D > 0. With the orthogonal projection $ : R-° — > M from Section [3] and the 
Riemannian exponential exp p of M at p we have the 

intrinsic tangent space coordinate expp 1 ^ and the 
residual tangent space coordinate d<& p (X — p) , 

respectively, of X at p, if existent. 
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Figure 1: Projections (if existent) of 
two points (crosses) in the X-plane to 
the open line segment A = {(Ai,A2) : 
Ai + A2 = 1, Ai,Aa > 0}. The dotted 
line gives the central projections (de- 
noted by stars) which is well defined 
for all symmetric, positive definite ma- 
trices ( corresponding to the first open 
quadrant), the dashed line gives the 
orthogonal projection ( circle ) which is 
well defined in the triangle below A 
( corresponding to V) and above A in 
an open strip. In particular it exists 
not for the right point. 



6.1 Finite Power of Tests and Tangent Space Coordinates 

With the above setup, assume that /i € M is a unique mean of X. Moreover, 
we assume that M is curved near fi, i.e. that there is c G M. D , the center 
of the osculatory circle touching the geodesic segment in M from X to n at 
fj, with radius r. If X r is the orthogonal projection of X to that circle, then 
X = X r + 0{\\X - /i|| 3 ). Moreover, with 

x ~ c M-c\ 1 lv „ _ v , „ ntv .,,3. 



-(X r -c,fi-c} + 0(\\X -/i||' 5 ) 



\X — c|| r I r 
we have the residual tangent space coordinate 

v = X — c — — || X — c\\ cos a = X r — c — (/j, — c) cos a + 0(\\X — /i|| 3 ) 

r 

having squared length ||?;|| 2 = r 2 sma 2 + 0(\\X — /^l) 3 )- By isometry of the 
embedding, the intrinsic tangent space coordinate is given by 

IMI 

With the component 

fjj — c 

v = u — c — \\X — ell cosa = (u — c)(l — cosa) + OfllAT — /zll 3 ) 

r 

of X normal to the above mentioned geodesic segment of squared length ||^|| 2 = 
r 2 (l-cosa) 2 + 0(||A-^|| 3 ), we obtain || exp^ 1 AT|| 2 = \\v\\ 2 + \\v\\ 2 + 0(\\X - 
/lx || 3 ) , since 

4 

(1 - cosa) 2 + sin 2 a = 2(1 -cosa) = a 2 + 2— H 

and a — 0(\\X ~ In consequence we have 
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Remark 6.1. In approximation, the variation of intrinsic tangent space coor- 
dinates is the sum of the variation \\v\\ 2 of residual tangent space coordinates 
and the variation normal to it. In particular, for spheres 

Hexp^XH 2 > N| 2 + |M| 2 . 

Since the variation in normal space is irrelevant for a two-sample test for equal- 
ity of means, say, a higher power for tests based on intrinsic means can be 
expected when solely residual tangent space coordinates obtained from an iso- 
metric embedding are used rather than intrinsic tangent space coordinates. 

Note that the natural tangent space coordinates for Ziezold means are resid- 
ual. 

A simulated classification example in Section [7] illustrates this effect. 



6.2 The 1:3- Property for Spherical and Kendall Shape 
Means 

In this section M = 5 D_1 C M. D is the (D — l)-dimensional unit-hypersphere 
embedded isometrically in Euclidean Z?-dimensional space. The orthogonal pro- 
jection $ : M. D —> S ^ 1 : p — > u^ij- is well defined except for the origin p = 0, 

and the normal space at p G S ^ 1 is spanned by p itself. In consequence, a 
random point X on S ^ 1 has 

d$ p (X-p) =X-pcosa, ex Pp 1 (X) = |-"« pV y > im x L p 

as residual and intrinsic, resp., tangent space coordinate at —X / p £ S ^ 1 
where cosa = (X,p), a € [0,7r). 

Theorem 6.2. If X a.s. is contained in an open half sphere, it has a unique 
intrinsic mean which is assumed in the interior of that half sphere. 

Proof. Below, we show that every intrinsic mean necessarily lies within the in- 
terior of the half sphere. Then, (jKendalll . ll99Ct Theorem 7.3) yields unique- 
ness. W.l.o.g. let X = (sine/), x%, . . . , x n ) such that P{sin</> < 0} = = 
1 — P{sin0 > 0} and assume that p = (sm^,p2; ■ ■ ■ >Pn) € S ^ 1 is an intrinsic 
mean, — ir/2 <<f>,ip < n/2. Moreover let p' — (sin(|V'|),P2, ■ ■ ■ >Pn)- Since 

Edlcxpp 1 ^)!! 2 ) = E(arccos 2 (p,X)) 

(A 

= E arccos sin ip sin 
> Efllexp", 1 ^)! 
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with equality if and only if sin \ip\ — sin^, this can only happen for sinV> > 0. 
Now, suppose that p = {0,p2, ■ ■ ■ ,p n ) is an intrinsic mean. For small determin- 
istic ip >0 consider p(if>) = (sin^,Pi cos tp, ... ,p n cosi/j). Then 



E 



^|| exp p ^(X)\\ 2 ^j — E | arccos 2 | sin ip sin cf> + cos tjj ^ pjXj 

V V *= 2 

= Edlexp; 1 ^)!! 2 )-^^^^^ 2 ) 

with C\ > since P{sin0 > 0} > 0. In consequence, p cannot be an intrinsic 
mean. Hence, we have shown that every intrinsic mean is contained in the 
interior of the half sphere. □ 

Remark 6.3. For the special case of sphere s, this is a simple proof for the 
general t heorem recently e stabli shed bv \Afsari if 201(a) which extends results of 



Karche\ \l9li ): \Kendal\ \l99& ) and \Lfj n200i . \2004 )- stating that the intrin- 



sic mean on a general manifold is unique if among others the support of the 
distribution is contained in a geodesic half ball. 

The following theorem characterizes the three spherical means. 

Theorem 6.4. Let X be a random point on S ^ 1 . Then x^ e ' G S ^ 1 is the 
unique extrinsic mean if and only if the Euclidean mean E(X) — J s n-i X dfx 
is non-zero. In that case 

A (e) .x {e) = EpQ 

with X (e) = ||E(X)|| > 0. Moreover, there are suitable > and X® > 
such that every residual mean x^ € S ^ 1 satisfies 

A (r) a; w =E({X,x ( - r) )X) , 
and every intrinsic mean x^ G S ^ 1 satisfies 



v/l-(X,a;W) 



2 



In the last case we additionally require thatE ^ c i c _°^ i < ) ^ (X,x^)J > which 
is in particular the case if X is a.s. contained in an open half sphere. 



Proof. The assertions for the extrinsic mean are well known from lHendriks et al 



(1996). The second assertion for residual means follows from minimization of 

\\p - (p,x)x\\ 2 dP x {p) = l- { (p 7 x) 2 dP x {p) 



n-i 



with respect to x G M. D under the constraining condition ||x|| = 1. Using a 
Lagrange ansatz this leads to the necessary condition 

(p,x)pdF x (p) = Xx 

S D-1 
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Figure 2: Means on a circle of a dis- 
tribution taking the upper dotted value 
with probability 1/3 and the lower right 
dotted value with probability 2/3. The 
latter happens to be one of the two 
residual means. 



with a Lagrange multiplier A of value K((X, x) 2 ) which is positive unless X is 
supported by the hypersphere orthogonal to x. In that case, by Proposition ^. 31 
x cannot be a residual mean of X, as every residual mean is as well contained 
in that hypersphere. Hence, we have A (r) := A > 0. 
The Lagrange method applied to 

/ ||exp- 1 (p)|| a dP x (p)= / axccos 2 ((p,x))dF x (p) 
Js - 1 Js - 1 

taking into account Theorem 16. 2[ insuring that a;W is in the open half sphere 
that contatins X a.s., yields the third assertion on the intrinsic mean. □ 

Recall that residual means are eigenvectors to the largest eigenvalue of the 
matrix E(XX T ). As such, they rather reflect the mode than the classical mean 
of a distribution: 

Example 6.5. Consider 7 € (0, n) and a random variable X on the unit circle 
{e 10 : 9 £ [0, 2ir)} which takes the value 1 with probability 2/3 and e %1 with prob- 
ability 1/3. Then, explicit computation gives the unique intrinsic and extrinsic 
mean as well as the two residual means 

(i) iZ (e) i arctan „ f 111 1 (r) i ik arctan Q l ln ^ 2 /?? , 

x y ' = e 3 , x y ' — e 2+cos 7 ; x y ' = ± e 2 2+co 8 (2 T ) _ 

Figure [H shows the case 7 = -| . 

In contrast to Figure^ one may assume in many practical applications that 
the mutual distances of the unique intrinsic mean x^\ the unique extrinsic 
mean x^ and the unique residual mean x^ r °^ closer to x^ are rather small, 
namely of the same order as the squared proximity of the modulus ||E(X)|| of 
the Euclidean mean to 1, cf. Tabled We will use the following condition 

\\x^ -x^W , ||x( e )-a;W|| = 0((1 - || E(X)||) 2 ) (7) 

with the concentration parameter 1 — || E(X)||. 

Corollary 6.6. Under condition Q), if all three means are unique, then the 
great circular segment between the residual mean x( r °) closer to the extrinsic 
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mean x^ and the intrinsic mean x^ is divided by the extrinsic mean in ap- 
proximation by the ratio 1:3: 

/ \ \\E(X)\\ f , , E((X - x ie \X) X) , „,\ 

= S^r } - to j +Q(d-ii^)ii) 2 ) 



|E(X)|| 



( , I E((X - x<- e \ X) X) „ , ' 



AW I 3 ||E(X)| 

wii/i A*-^ and A^ 1 *^ from Theorem \6.4\ 

Proof. For any x,p G decompose p — x = p — (x,p) x — z(x,p)x with 

z{x,p) = 1 — (x,p), the length of the part of p normal to the tangent space at 
x. Note that E [z{x^ e \X) = 1 - || E(X)||. Now, under condition 0, verify the 
first assertion using Theorem 16.41 

X ™ = ^(E(X)-E(z(x^\X)X 

On the other hand since 



arccosfl — z) 1 2 9 



v/l-(i-^) 2 3 15 

we obtain with the same argument the second assertion 

w = 1 /arccos^.xW) \ 
A« y v/l-^^W) 2 / 

= ± (E(I) + iE(^U)l)+AE(z( I («),X) 2 x)+...j 

- ffiM + E (;:,•.. Y, X) + 0((1 - || K W ||) a 

□ 

Remark 6.7. TTie tangent vector defining the great circle approximately con- 
necting the three means is obtained from correcting with the expected normal 
component of any of the means. As numerical experiments show, this great 
circle is different from t he fir st principal component geodesic as defined in 



Huckemann and Ziezold {200 A) . 



Recall the following connection between top and quotient space means, cf. 
Theorem 14.51 

Remark 6.8. Let p £ such that a random shape X on is supported by 
(£^)(M) U A with A C S^j at most countable. By Lemma \4-1\ and Theorem \4-5[ 
(E*,)^]) U A admits a horizontal measurable lift L C Sf n in optimal position to 
p E Sj n . Define the random variable Y on L C S!f n by iroY = X . Then we have 
that 
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if [p] is an intrinsic mean of X then p is an intrinsic mean of Y , 

if [p] is a full Procrustean mean of X then p is a residual mean of Y , 

if [p] is a Ziezold mean of X then p is an extrinsic mean ofY. 

In consequence, Corollary 16.61 extends at once to Kendall's shape spaces. 
Generalized geodesies referred to below a re an extension o f the c oncept of 
geodesies to non-manifold shape spaces, cf. Huckemann et al. ( 2010bl ). 

Corollary 6.9. Suppose that a random shape X on with unique intrinsic 
mean /x^, unique Ziezold mean and unique Procrustean mean (j,( p > is sup- 
ported by R= (S^)^' 1 '' n (££J^ (2) ) n (E£jO (P '). // the means are sufficiently 
close to each other in the sense of 



0((1-||E(F)||) 2 



with the random pre-shape Y on a horizontal lift L of R defined by X = tt o Y , 
then the generalized geodesic segment between (iW and fj,^ is approximately 
divided by fj,^ by the ratio 1 : 3 with an error of order 0((1 — || E(F)||) 2 ). 

7 Examples: Exemplary Datasets and Simula- 
tions 



All of the results of this section are based on datasets and simulations, i.e., all 
means considered are sample means. An R-package for t he computation of a ll 
means including the poplar leaves data can be found under iHuckemann (|2010bl) . 



7.1 The 1 : 3 property 

In the first example we illustrate Corollarv l6.9l on the basis of four classical data 
sets: 

poplar leaves: contains 104 quadrangular planar shapes extracted from poplar 

leaves in a joint collaboration with Institu te for Forest Biomet r y and Infer- 

matics at the University of Gottingen, cf . iHuckemannl (|2010bh ; IHuckemann et al 
(l2010ah . 



digits '3': contains 30 planar s hapes with 13 landmarks e ach, extracted from 
handwritten digits '3', cf. (|Drvden and Mardial . 1 19981 p. 318). 



macaque skulls: contains t hree-dimensional shapes wi th 7 landmarks each, of 
18 macaque skulls, cf. ( Drvden and Mardia , 1998, p. 16). 



iron age brooches: c ontains 28 th ree-dimensional tetrahedral shapes of iron 
age brooches, cf. ( Small . Il996l Section 3.5). 



As clearly visible from Figure [3] and Tabled the approximation of Corollary 
6.91 for two- and three-dimensional shapes is highly accurate for data of little 
dispersion (the macaque skull data) and still fairly accurate for highly dispersed 
data (the digits '3' data). 
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means of 104 poplar leaf shapes 



means of 30 digits '3' 
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-0.0020 
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Figure 3: Depicting shape means for four typical data sets: intrinsic (star), 
Ziezold (circle) and full Procrustes (diamond) projected to the tangent space at 
the intrinsic mean. The cross divides the generalized geodesic segment joining 
the intrinsic with the full Procrustes mean by the ratio 1:3. 



data set 


m. m. m 


(1-||E(Y)||) 2 


poplar leaves 
digits '3' 
macaque skulls 
iron age brooches 


6.05e - 05 1.83e - 04 2.44e - 04 
0.00154 0.00452 0.00605 

1.96e - 05 5.89e - 05 7.85e - 05 
0.000578 0.001713 0.002291 


5.24e - 05 
0.00155 
7.59e - 06 
0.000217 



Table 2: Mutual shape distances between intrinsic mean ffi' , Ziezold mean jjS z ^ 
and full Procrustes mean fj,^ for various data sets. Last column: the concen- 
tration parameter from cf. also Corollary \6.9i 



7.2 "Partial Blindness" of full Procrustes and Schoenberg 
Means 

In the second example we illustrate an effect of "blindness to data" of full 
Procrustes means and Schoenberg means. The former blindness is due to the 
affinity of the Procrustes mean to the mode in conjunction with curvature, the 
latter is due to non-isometry of the Schoenberg embedding. While the former 
effect occurs only for some highly dispersed data when the analog of condition 
([7]) is violated, the latter effect is local in nature and may occur for concentrated 
data as well. 
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q1 



q1 



q2 



intrinsic 
mean 



Ziezold 
mean 



Procrustes 
mean 



Figure 4: A data set of 
three planar triangles ( top 
row) with its correspond- 
ing intrinsic mean (bot- 
tom left), Ziezold mean 
(bottom center) and full 
Procrustes mean (bottom 
right). 



Reenacting the situation of Section I4.41 cf. also Example I6.5I and Figure [3J 
the shapes of the triangles 91 and 92 m Figure 0] are almost maximally remote. 
Since the mode 91 is assumed twice and 92 only once, the full Procrustes mean 
is nearly blind to 92. 



qi 



q2 

— - =ro 



intrinsic 
mean (q1,q2) 



Schoenberg 
mean (q1,q2) 



intrinsic 
mean (q1,q2,q) 



Schoenberg 
mean (q1,q2,q) 



Figure 5: Planar triangles 
91 = x cos (3 — u>2 sin (3, 
q2 = x cos (3 + W2 sin (3 
and q = x cos (3 + w± sin j3 
with x,W\,W2 from Re- 
markUPA cp = 0.05, (3 = 
0.3 (top row). Intrin- 
sic means (middle row) of 
sample (91,(72) (left) and 
(91,92,9) (right). Schoen- 
berg means (bottom row) 
of sample (91,92) (left) 
and (91,92,9) (right). 



Even though Schoenberg means have been introduced to tackle 3D shapes, 
the effect of "blindness" can be well illustrated already for 2D. To this end con- 
sider x = x{4>), wi — w\{4>) and W2 = W2{4>) as introduced in Remark l5.3l Along 
the horizontal geodesic through x with initial velocity W2 we pick two points 
91 = x cos (3 + W2 sin (3 and 92 = x cos j3 — W2 sin (3. On the orthogonal horizontal 
geodesic through x with initial velocity w\ pick 9 = cccos/3' + w\ sin/3'. Recall 
from Remark 1 5. 3 1 that along that geodesic the derivative of the Schoenberg em- 
bedding can be made arbitrarily small for (f> near 0. Indeed, Figure [5] illustrates 
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that in contrast to the intrinsic mean, the Schoenberg mean is "blind" to the 
strong collinearity of q\ and q2- 

7.3 Discrimination Power 

In the ultimate example we illustrate the consequences of the choice of tangent 
space coordinates and the effect of the tendency of the Schoenberg mean to in- 
crease dimension by a classification simulation. To this end we apply a Hotelling 
T 2 -test to discriminate the shapes of 10 noisy samples of regular unit cubes from 
the shapes of 10 noisy samples of pyramids with top section chopped off, each 
with 8 landmarks, given by the following configuration matrix 

(0 1 i£ ifs 1 if i?\ 

o o ^ ifs i i % i±H 

\ e e e e J 

(cf. Figure [5]) determined by e > 0. In the simulation, independent Gaussian 
noise is added to each landmark measurement. Table |3] gives the percentages of 
correct classifications. 



Figure 6: cube (left) and pyramid of varying height e (right) for classification. 





intrinsic mean with 


Ziczold 


Schoenberg 


e 


intrinsic residual 


mean 


mean 




tangent space coordinates 






0.0 


70 % 74 % 


74% 


64% 


0.2 


56 % 58 % 


57% 


51% 


0.3 


41 % 42 % 


42% 


42% 



Table 3: Percentage of correct classifications within 1,000 simulations each of 10 
unit-cubes and 10 pyramids determined by e (which gives the height), where each 
landmark is independently corrupted by Gaussian noise of variance a 2 = 0.2 via 
a Hotelling T 2 -test for equality of means to the significance level 0.05. 

As visible from Table [3j discriminating flattened pyramids (e = 0) from 
cubes (e = 1) is achieved much better by employing intrinsic or Ziezold means 
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rather than Schoenberg means. This finding is in concord with Theorem 15.41 
samples of size 10 of two-dimensional configurations yield Euclidean means a.s. 
in V 1 which are projected to V 3 to obtain Schoenberg means in E|. In con- 
sequence, Schoenberg means of noisy nearly two-dimensional pyramids are es- 
sentially three dimensional. With increased height of the pyramid, (e > 0, i.e. 
for more pronounced third dimension and increased proximity to the unit cube) 
this effect waynes and all means perform equally well (or bad). Moreover in 
any case, intrinsic means with intrinsic tangent space coordinates qualify less 
for shape discrimination than intrinsic means with residual tangent space co- 
ordinates, cf. Remark 16.11 The latter (intrinsic means with residual tangent 
space coordinates) are better or equally well behaved as Ziezold means (which 
naturally use residual tangent space coordinates). 



intrinsic mean 


Ziezold mean 


Schoenberg mean 


0.24 


0.18 


0.04 



Table 4: Average time in seconds for the computation of means in £§ of sample 
size 20 on a PC with a 800 MHZ CPU based on 1, 000 repetitions. 

In conclusion we record the time for the computations of means in Table 
|4j While Ziezold means compute in approximately 3/4 of the computational 
time for intrinsic means, Schoenberg means are obtained approximately 6 times 
faster. 

8 Discussion 

By establishing stability results for intrinsic and Ziezold means on the manifold 
part of a shape space, a gap in asymptotic theory for general non-manifold shape 
spaces could be closed, now allowing for multi-sample tests of equality of intrinsic 
means and Ziezold means. A similar stability assertion in general is false for 
Procrustean means for low concentration. There is reason to believe, however, 
that it would be true for higher concentration. Note that the argument applied 
to intrinsic and Ziezold means fails for Procrustean means, since in contrast 
to the equations in Theorem 14.51 the sum of Procrustes residuals is in general 
non-zero. Loosely speaking, the findings on dimensionality condense to 

- Procrustean means may decrease dimension by 2 or more, 

- intrinsic and Ziezold means decrease dimension at most by 1, in particular, 
they preserve regularity, 

- Schoenberg means tend to increase up to the maximal dimension possible. 

Due to the proximity of Ziezold and intrinsic means on Kendall's shape 
spaces in most practical applications, taking into account that the former are 
computationally easier accessable (optimally positioning and Euclidean aver- 
aging in every iteration step) than intrinsic means (optimally positioning and 
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weighted averaging in every iteration step), Ziezold means can be preferred over 
intrinsic means. They may be even more preferred over intrinsic means, since 
Ziezold means naturally come with residual tangent space coordinates which 
may allow in case of intrinsic means for a higher finite power of tests than 
intrinsic tangent space coordinates. 

Computationally much faster (not relying on iteration at all) are Schoen- 
berg means which are available for Kendall's reflection shape spaces. As a 
drawback, however, Schoenberg means, seem less sensitive for dimensionality 
of configurations considered than intrinsic or Ziezold means. In particular for 
problems involving small sample sizes n and a large number of parameters p as 
currently of high interest in statistcal applications, involving (nearly) degener- 
ate data, Ziezold means may also be preferred over Schoenberg means due to 
higher power of tests. 

Finally, note that Ziezol d means may be defined for the shape spaces of 
planar curv es introduced by IZahn an d Roskics (Il972h. which are currently of 
interest e.g. iKlassen et al. ( 2004 ) or Schmidt et al. ( 20061 ). Employing Ziezold 
means there, a computational advantage greater than found here can be ex- 
pected since the computation of iterates of intrinsic means involves computa- 
tions of geodesies which themselves can only be found iteratively. 
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A Proofs 

Lemma A.l. Let U C M be a tubular neighborhood about p G M that admits a 
slice via exp p D x j p G = U in optimal position to p. Then, there is a measurable 
horizontal lift L C exp p D of tt(U) in optimal position to p. 

Proof. If p is regular, then L = exp p D has the desired properties. Now assume 
that p is not of maximal orbit type. W.l.o.g. assume that D contains the 
closed ball B of radius r > with bounding sphere S = dB and that there 
are p 1 ,...,p J £ exp p (S') having the distinct orbit types orccuring in S. 5 J 
denotes all points on S of orbit type (G/I p j), j = 1, . . . , J, respectively. Observe 
that each is a manifold on which I p acts isometrically. Hence for every 
1 < j < J, there is a finite (Kj < oo) or countable [Kj — oo) sequence of 
tubular neighborhoods U J k C S 3 covering , admitting trivial slices via 




Here, exp 5 , 3 denotes the Riemannian exponential of S J ■ Defining a disjoint 
sequence k 

U{ := Ul, Ui +1 := Ul +1 \ U£ for 1 < k < K 3 1 
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exhausting S J we obtain a corresponding sequence of disjoint measurable sets 

exp s . 3 Di with 
Pi L 

e^ s J h D{xI p /I pi ^Ul l<k<K s . 

Hence, setting 

L{ := expg D{ and V := j tf fc 

fe=i 

observe that every p' G has a unique lift in U which is contained in a unique 
L k . This lift is by construction (all L k are in exp p D) in optimal position to 
p. Moreover, if p' € L k and gp' € for some g £ G with 1 < fc', fc < -Kj we 
have by the disjoint construction of U 3 k and C/jJ, that k — k' , hence the isotropy 
groups of gp' and p' agree, yielding gp' — p' . In consequence, L J is a measurable 
horizontal lift of S J in optimal position to p. Since every horizontal geodesic seg- 
ment 1 1 — y exp p (to), v € H p M contained in exp p 7? features a constant isotropy 
group, except possibly for the initial point we obtain with the definition of 

L := U/ =1 AF with M j := {exp p (» <E exp p (D) : v € exp~ t > 0} 

a measurable horizontal lift of tt(U) in optimal position to p. □ 

Proof of Theorem 14.41 Since M is connected, any two points p,p' can be 
brought into optimal position p, gp' and a closed minimizing horizontal geodesic 
segment 7 9p / between p, gp' can be found. If [p'] e then also 7 9P ' C 

In consequence, there are tubulars neighborhoods U p of p and t/ p ' of 7 9P ' 
admitting slices in optimal position to p, which by Lemma lA.ll have horizontal 
lifts L p and L p > in optimal position to p. Since M is a manifold, there is a 
sequence [po], ■ ■ ■ € gj G G,p,, e M such that po = P and that each g^p^ 

is in optimal position to p (j £ J, J C N) and such that 

Q<M> c |J 7r(Z7 p ,) 

JGJU{0} 

with measurable horizontal lifts L Pj of Tt(U Pj ). Defining L' po := L po and re- 
cursively L' p := L Pj+1 \ L'p. for j = 1, . . . a measurable horizontal lift L' := 
U^gLp^ of in optimal position to p is obtained. Finally, suppose that pj is 
in optimal position to p for pj € [p^] € A and set Lq : ~ -^j' := ^j'-i U fe i ^ 
[p_j] flL - =0 and L" = L'-_ 1 (j > 1) otherwise to obtain the desired measurable 
horizontal lift L" := U^jg^L" in optimal position to p. 

Proof of Theorem 14.71 In case of intrinsic means, with the hypotheses and 
notations of the above proof of Theorem 14.41 suppose that L" is a measurable 
horizontal lift of Q"P\) u A in optimal position to an intrinsic mean p £ M of 
the random element Y on M defined as in Theorem E3] with [p] e E<- d Q\X). 
For notational simplicity we assume that Q ( [ p l> = tt(U) with a single tubular 
neighborhood U of p admitting a slice. 
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Then, additionally using the notation of the above proof of Lemma IA.11 if 
the assertion of the Theorem would be false, w.l.o.g. there would be g € I p , 
1 < j < J, p 3 £ S j with gp 3 7^ p 3 and P{Y £ M j } > 0. In particular, in the 
proof Lemma IA.11 we may choose a sufficiently small Ul around pj such that 
in consequence of ((H) 

f (exp- 1 Y-exp- 1 (gY))dF Y + (8) 

with some e,r > 0, M 3 k {e) :— {exp p (tv) £ exp p (D) : v £ cxp~ 1 (L 3 k ), \t — r\ < e} 
and L J k obtained from U k as in the proof of Lemma IA.11 Suppose that L C L" 
is obtained as in the proof of Lemma IA.11 by using L k and suppose that L" is 
obtained from L" by replacing the M k {e) part of M k with {exp p (tv) £ exp p (Z?) : 

v £ exp p 1 (gL k ), \ t — r\ < e}. Then L" is also a measurable horizontal lift in 
optimal position to p. Since we assume that [p] is an intrinsic mean of X, 
assertion (i) of Theorem 14.51 teaches that p is also an intrinsic mean of lift Y of 
XtoL", i.e. 

= / exp^YdFy- I exp^Ydfy 

J L" J L" 

= [ (ex P ; 1 y-ex P? ; 1 (. g r))dP y . 

This is a contradiction to © yielding the validity of the theorem for intrinsic 
means. 

The assertion in case^ of Ziezold means is similarly obtained. Use the same 
horizontal lifts L" and L" from above, replace exp" 1 Y, exp" 1 Y and exp~ 1 (gY) 

by df]f xt (p), dfextip) an< ^ ^fext(p)> respectively, use the hypothesis ([5]) to obtain 
the analog of (JSJ) and finally obtain the contradiction arguing with assertion (ii) 
of Theorem 14.51 
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On the Meaning of Mean Shape 

Stephan Huckemann 



Abstract 

A stability result for intrinsic means on the quotient due to an isomet- 
ric and proper Lie group action on a Riemannian manifold is derived, stat- 
ing that intrinsic means are contained in the highest dimensional manifold 
stratum assumed with non-zero probability. In consequence, the Central 
Limit Theorem (CLT) for manifold valued random elements can be ex- 
tended to non-manifold shape spaces. The relationship to other types of 
means is disussed: the CLT extends to Ziezold means but not in general 
to Procrustes means, since the latter may be contained in lower dimen- 
sional manifold strata. In contrast, Schoenberg means tend to increase 
dimension. The finite power of tests may increase when switching from 
intrinsic tangent space coordinates to residual tangent space coordinates. 
This and computational speed makes Ziezold means most competitive for 
multi-sample test. Curiously, intrinsic, Ziezold, and Procrustes means are 
related in such that in approximation the Ziezold means divides the gen- 
eralized geodesic segment between intrinsic and Procrustes mean by the 
ratio 1 : 3. Classical data-sets as well as numerical simulations illustrate 
the findings. 
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1 Introduction 



The analysis of shape may be counted among the very early activities of mankind; 
be it for representation on cultural artefacts, or for morphological, biological 
and medical application, to name only two of its driving forces. In modern 
days shape analysis is gaining increased momentum in computer vision, im- 
age analysis, biomed icine and many o ther fields. For a fairly recent overview 
over key aspects cf. iKrim and Yezzil ([20061 ) . To begin with, already the con- 
cept of an expected, averaged or mean shape is surprisingly non trivial and not 
at all canonical. This is due to the fact that spaces of shapes usually admit 
several natural structures, and, in many cases none of the m is that o f a Eu - 
clidean space. While some structures are Euclidean (e.g. Mosimannl ( 197Cl[ ). 



Hobolth et al. ( 2002 ). Hotz et al. ( 2010() ). st ructures that may be c o nsidered 
benign are that of Riemanian manifolds (e.g. Blum and Nagel ( 19781) . Kendall 
(1984)). Many shape space, however, admit only the structure of a Riemannian 
manifold - the pre-shape space of configurations - modulo the isometric ac- 
tion of a Lie group, conveying shape equivalence. We call the quotient a shape 
space. It carries a canonical quotient structure of a union of manifold strata 
of different dimensions, which give in general a Riemannian manifold - the 
manifold part - with singularities at some of which the curvature may tend to 
infinity. Pr ominent examp l e are Kendall's three- and higher-dimensional shape 



cf. Kendall et al. (1999), and the spaces of closed planar curves, see 



spac es, _ 

IZahn and Roskiesl (|l972^ 

In a Euclidean space, there is a clear and unique concept of a mean in terms of 
least squares minimization: the arithmetic avarage. Generalizing to manifolds, 
however, with every embedding in a Euclidean space come specific concepts 
of extrinsic means and residual means, and with every manifold structure a 
specific concept of an intrinsic mean. Carrying statistics over to manifolds, 
strong consistency and central limit th eorem s (CL T s) for extrinsic, residual 
and intrinsic means have been derived by JuppI (Il988 ^_ Hendriks and Landsman 
( 19961 119981) as well as bv lBhattacharva and Patrangenarul (|2003l . l2005l ). Under 
quoticnting these means generalize to Ziezold means and Procrustes means, 
respectively, and again, to intrinsic means. For a CLT to hold, obviously, a 
manifold struc ture is i nevit able. Due to strong consistency, which has been 
established bv lZiezoldl(ll977l) on general quasi-metrical spaces, for a one-sample 
test for a specific mean shape on the manifold part, it may be assumed that 
sample means eventually lie on the manifold part as well, thus making the above 
CLT available. Multi-sample tests, however, could not be theoretically justified 
because it remained unclear whether means of random shapes on the manifold 
part come to lie on the manifold part again. 

As the key result in this paper it is shown that intrinsic and Ziezold means lie 
on the manifold part whenever the manifold part is assumed with non-zero prob- 
ability. In consequence the CLT for manifolds can be applied in general, thus 
making multi-sample tests for these mean shapes possible. We note that specifi- 
cally for the non-manifold Kendal l reflection shape spaces, Schoenberq m eans as 
have been recently i ntrodu ced by iBandulasiri and Patrangenarul ( 20051 ) as well 
as bv lDrvden et al. ( 20081 ) also allow for asymptotic inference. 

This paper is structured as follows. In motivation in Section [21 Kendall's 
shape spaces are introduced along with the specific application of the key result. 
The following Section [3] lays out the various concepts of means in a generic shape 
space setting. In Section [4] the stability theorem, namely that 

intrinsic means preserve dimension 

is established. The proof is based on lifting a distribution on the shape space to 
the pre-shape space and subsequently exploiting the fact that intrinsic means are 
zeroes of an integral involving the Riemann exponential. The similar argument 
can be applied to Ziezold means but not to Procrustes means. The population 
case is more intrigued, however, as the lifting requires the concept of tubular 
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neighborhoods admitting slices. In Section [SJ extrinsic means are discussed, in 
particular that Schoenberg means are due to a non-isometric embedding and 
that they tend to "increase the dimension" . Section [5] tackles local effects of 
curvature: Section 16.11 motivates why residual tangent space coordinates may 
yield a higher finite power than intrinsic tangent space coordinates. The closeup 
on spherical means in Section \6 . 2 1 reveals that, if the following means are unique, 
the generalized geodesic segment between the intrinsic mean and the Procrustes 
mean (closer to the data) is in approximation divided by the Ziezold mean by 
the ration 1:3. Section [7] illustrates the practical effects of the theoretical 
results using classical data-sets as well as simulations. 



2 Motivation: Kendall's Shape Spaces 

In the statistical analysis of similarity shapes based on landmark configurations, 
geometrical m-dimensional objects (usually m = 2,3) are studied by placing 
k > m landmarks at specific locations of each object. Each object is then de- 
scribed by a matrix in the space M(m, k) of rn x k matrices, each of the k 
columns denoting an m-dimensional landmark vector, (x, y) :— tr(xy T ) denotes 
the usual inner product with norm ||a;|| = \J (x, x) . For convenience and without 
loss of generality for the considerations below, only centered configurations are 
considered. Centering can be achieved by multiplying with a sub-Helmert ma- 
trix T~L G M (k, k— 1) from the right, yieldin g a configuration xT-L in M( w, fc — 1). 



For this and other centering methods cf. iDrvden and Mardial (|1998l Chapter 
2). Excluding also all configurations with all landmarks coinciding gives the 
space of configurations 

Ft := M(m,fc-1)\{0}. 

Since only the similarity shape is of concern, we may assume that all configu- 
rations are contained in the unit sphere := {x € M(m,k — 1) : ||.t|| = 1}. 
Then, Kendall's shape space is the canonical quotient 

£^ := S^/SO(m) = {[x] :ieSj,} with the fiber [x] = {gx : g G SO{m)} . 

In some applications reflections are also filtered out giving Kendall's reflection 
shape space 

En k m :=V k m /{e,e}=Si/0(m). 

Here, 0(m) = {g € M(m, m) : g T g = e} denotes the orthogonal group with 
the unit matrix e = diag(l, . . . , 1), e = diag(— 1, 1, . . . , 1) and SO(m) — {g G 
0(m) : det(g) — 1} is the special orthogonal group. 

For 1 < j < m < k consider the (non-unique) embedding 

^ S k m : x ^ (-^\ (1) 

giving rise to a unique and canonical embedding of ^ which is isometric 
w.r.t. the canonical intrinsic distance, the Procrustes distance and the Ziezold 
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distance defined in Section [31 cf. Kendall et al. (1999, pp. 29, 206). In conse- 



quence, intrinsic means, Procrustes means and Ziezold means (also denned in 
Section |3J of random elements taking values in E* C Sj^, are also contained 
within 

Remark 2.1. Suppose that [X] is a random shape on assuming shapes of 
configurations of dimension less than or equal to j (1 < j < m) with probability 
one, then every intrinsic, Procrustes and Ziezold mean shape of [X] corresponds 
to a configuration of dimension less than or equal to j . 

The main result of this paper in form Corollaries 14.41 and 14.91 applied to 
Kendall's spaces states that intrinsic and Ziezold means are also non decreasing 
in dimension. In view of Ziezold means, Remark 13.41 provides invariant optimal 
positioning and © on page [5] is valid on the pre-shape sphere. The technical 
definition of a cut locus on the quotient and a tubular neighborhood admitting 
a slice can be found in Section 14.11 

Theorem 2.2 (Intrinsic and Ziezold Kendall Means Preserve Dimension). Sup- 
pose that [X] is a random pre-shape on m > k with intrinsic or Ziezold 
mean shape [fi] G [i G S^. Moreover suppose that [X] either assumes dis- 
crete values outside the cut locus of \p] a.s. or that it is supported by a projection 
of a tubular neighborhood around fi admitting a slice. If [X] assumes shapes 
corresponding to non- degenerate configurations up to dimenson j (1 < j < m) 
with non-zero probability then fi corresponds to a non- degenerate j -dimensional 
configuration. 



3 Frechet yo-means and General Shape Spaces 



In the previous section we introduced Kendall's shape and reflection shape space 
based on invariance under similarity transformations; e.g. invariance under 
congruence transformations only leads to Kendall's size- and- shape space. More 
generally in image analys is, invariance may also be considered un d er the affine 



or pro jective group, cf. iMardia and Patrangenarul (|200l[ |2005l ): iMunk et a 
(2008). A different yet also very popular popular set of shape spaces for two- 



di mensional configura t ions m odulo the group of similarites has been introduced 
by IZahn and Roskies (1972). Instead of building on a finite dimensional Eu- 
clidean matrix space modeling landmarks, the basic ingredient of these spaces 
modeling closed planar unit speed curves is the infinite dimensional Hilbert 
space of Fourier series. In practice for numerical computation, only finitely 
many Fourier coefficients are considered. 

To start with, a shape space is a metric space (Q, d). For this entire paper 
suppose that X, X\, X2, ■ ■ ■ are i.i.d. random elements mapping from an abstract 
probability space (£l,A,P) to (Q,d) equipped with its self understood Borel 
cr-field. Moreover, denote by E(Y) the classical expected value of a random 
element Y on a D-dimensional Euclidean space R D , if existent. 
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Definition 3.1. For a continuous function p : Q x Q 
of population Frechet p-means by 



£M(X)=argmin MeQ E(p(X,/i) 2 ) 



For lj £ n denote by 



[0,oo) define the set 



E n ] M = argmin^Q ^ p( X j( u ), pf 
i=i 

the set of sample Frechet p- means. 

By continuity of p, the mean sets are closed random sets. For our purpose 
here, we rely on the definition of random closed sets as introduced and studied 



ne re, we rely on tn e definitio n 01 ra ndom closed sets as introduced and studied 
by IChoaued (|l954l ). iKendalll (Il974l) a nd iMatheronl (| 19751) . Since their original 



definition for p = d by Freched (| 19481) such means have found much interest 



Intrinsic me ans. Independently, for a Rie mannian manifold with geodesic 
distance d — p, IKobavashi and Nomizu (|l969l) defined the corresponding means 
as centers of gravity which are nowadays als o well known as intrinsic means 
(jBhattacharva and Patrangenaru 



Extrinsic means. W.r.t. the chordal or extrinsic metric d = p due to an 
embedding of a Riemannian manifold in an ambient Eu clidean space, such 
me ans have been called mean locations by iHendriksl (|l99lh or extrinsic means 
by iBhattacharva and Patrangenaru ( 2003 ). 

More precisely, let Q = M C R D be a Riemannian manifold embedded in 
a Euclidean space R D and let $ : R D — s- M denote the orthogonal projection, 
<5>{x) = infp g M \\x — p\\. For any Riemannian manifol d an embed ding that is 
even isometric can be found for D sufficiently large, see Nashl (1 19561 ) . Due to an 
extension of Sard's Theorem by Bhattacharva and Patrangenaru! (2003. p. 12) 
for a closed manifold, $ is uni-valent up to a set of Lebesgue measure zero. 
Then the set of extrinsic means is given by th e set of images & (E(Y)) where Y 
denot es X viewed as taking values in R 13 (cf. IBhattacharva and Patrangenaru 

j20oah ). 



Residual means. In this context, setting p(p,q) = \\d& q (p — q)\\ (p,q e M) 
with the derivative d$ 9 at q yielding the orthogonal projection to the embedded 
tangent space T q . 



T q M C T q R D , call the corresponding mean sets E^(X) 
and En P \uj), the sets of residual population means and residual sample means, 
respectively. For two-sph eres, p(p,q) has been studied under the name of crude 
residuals bv lJuppI {1988). In general, on unit-spheres 



p(p, q) = \\p - (p, = V 1 - (piQ) 2 = p(i,p) 



(2) 



is a quasi-metric (symmetric, vanishing on the diagonal p = q and satisfying the 
triangle inequality). 
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Due to curvature these concepts of means are essentially non-convex in the 
following sense. 



Remark 3.2. To a random point on a two-sphere, uniformly distributed along 
its equator, both north and south pole are intrinsic and extrinsic means. 

Residual means, however, seem less affected. 

Theorem 3.3. If a random point X on a unit sphere is a.s. contained in a 
unit subsphere S then S contains as well every residual mean of X . 

Proof. Suppose that x — v + v is a residual mean of X with v/||v|j G S and v 
normal to S. Since 1 - (X, v + vf = 1 - (X,v) 2 > 1 - (X,v) 2 /\\v\\ 2 a.s. with 
equality if and only if v = 0, the assertion follows at once from ©. □ 

Let us now incorporate more of the structure common to shape spaces: 
M is a complete connected D-dimensional Riemannian manifold with geodesic 
distance dM on which a Lie group G acts properly and isometrically from the 
left. Then we call the canonical quotient 

7T : M -> Q := M/G = [\p] : p G M} where [p] = {gp : g e G} , 

a shape space. As a consequence of the isometric action we have that dM^gp, q) = 
d(p,g~ 1 q) for all p,q £ M, g £ G. For p,q e M we say that p is in optimal 
position to q if c?m(p, q) = min se <3 d]\,i(gp, q), the minimum is att ained in conse- 
quence of the proper action. As is well known (e.g. Bredonl ( 1972 . p. 179)) there 



is an open and dense submanifold M* of M such that the canonical quotient 
Q* = M*/G restricted to M* carries a natural manifold structure also being 
open and dense in Q. Elements in M* and Q* , respectively, are called regular, 
the complementary elements are singular; Q* is the manifold part. 



Intrinsic means on shape spaces. The canonical distance 
d Q.([p\, M) : = mmd M (gp,q) = mm d M (gp,hq) 

g£& g,h£G 

is called intrinsic distance and the corresponding c?Q-Frechet mean sets are 
called intrinsic means. Note that the intrinsic distance is equal to the canonical 
geodesic distance on Q*. 



Ziezold and Procrustes means. Now, assume that we have an embedding 
with orthogonal projection $ : R — > M C M. D as above. If the action of G is 
isometric w.r.t. the extrinsic metric, i.e. if \\gp — gq\\ — \\p — q\\ for all p,q £ M 
and g € G then call 

( z) 

Pq (M>M) := ™n\\gp-q\\ 
p { $\\p]M) ■= mm\\d^ q {gp-q)\\ 
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the Ziezold distance and the Procrustes distance on Q, respectively. The corre- 
sponding Frechet mean sets are the Ziezold means and the Procrustes means, 
respectively. We say that optimal positioning is invariant if 

dM(g*P,q) = mm d M (gp,q) <^ \\g*p- <?ll = mm\\gp-q\\ 
for all p,q £ M and g* £ G. 



Rema rk 3.4. Indeed for Q = optimal positioning is invariant (cf. \Kendall et al 



199& p. 2 06)), Procrus tes means coincide with classical Procrustes means in 



tr oduced b y 



Goweh tl97&) and Ziezold means coincide with means as introduced 



bv \Ziezold \l99i ). Moreover for Q — S^, Procrus tes means agree with extrinsic 



means w.r.Hhe Veronese-Whitney embeddmg, cf. Ulhattarharua a,nd Pa,tra,n„enaA 
X200& ) and Section^ 



4 Stability Results for Intrinsic Means 

In this section we derive the stability results underlying Theorem 12.21 To this 
end we need to recall how a non-manifold shape space is made up from manifold 
strata of varying dimensions. 



4.1 Preliminaries 

As before, let M be a complete connected Riemannian manifold called the pre- 
shape space with intrinsic metric c£m on which a Lie group G acts properly 
and isometrically giving rise to the canonical quotient Q = M/G called the 
shape space with intrinsic metric dQ. The canonical projection is denoted by 
7T : M — > Q. T p M is the tangent space of M at p £ M and exp p denotes 
the Riemannian exponential at p. On a complete and connected manifold M 
we have for every p' £ M that there is v' £ T p M such that p' — exp p v'. 
v' — exp~ 1 p' of minimal modulus is uniquely determined as long asp £ M\C(p) 
with the cut locus C(p) of p which is a subset of measure zero in M. E.g. on a 
sphere, the cut locus of any point is its antipode. For q £ Q let 

C(q) := {[p'\ '■ p' £ C(p) is in optimal position to some p £ q] 

be the cut locus on the quotient of q £ Q. 

Next we recollect consequences of the Lie group action, see iBredonl (|l972h . 



The stability result in the sample case (below in Section |4~2|) rests on (A) and 
(B) alone. 

(A) With the isotropy group I p = {g £ G : gp = p} for p £ M, every orbit 
carries the natural structure of a coset space [p] = G/I p . Moreover, p' £ M 
is of orbit type (G/I p ) if I p ' = gl p g~ l = I gp for a suitable g £ G. If 
I p C I gp ' for suitable g £ G then p' is of lower orbit type than p. 
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(B) The pre-shapes of equal orbit type 

M {i P ) .- y e m : p' is of orbit type (G/I p )} 

and the corresponding shapes Q 1 * 1 ^ :— {[p'\ : p' £ M^ 1 ^} are manifolds in 
M and Q, respectively. 

(C) The orthogonal complement H p M in T p M of the tangent space T p [p] along 
the orbit is called the horizontal space: T p M — T p [p] © H p M . 

(D) The Slice Theorem (cf. iPalaid (Il960l) ) states that every p £ M has a 
tubular neighborhood [p] C U C M such that with a suitable open subset 
D C H p M the twisted product exp p Z) X/ p [p] is diffcomorphic with U. 
Here, the twisted product is the natural topological quotient of the product 
space exp p D x [p] modulo the equivalence 

(exp p v,gp) ~i p (exp p v' ,g'p) 3h £ I p such that v' = dhv, g' = ghT x . 

For short we say that the tubular neighborhood U admits a slice exp p D 
through p. 

(E) If a tubular neighborhood U of p admits a slice exp p D then every p' £ 
expp D is in optimal position to p. Moreover, all points p' £ U are of orbit 
type larger than or equal to the orbit type of p and only finitely many 
orbit types occur in U. If p is regular, i.e. of maximal orbit type, then 
the quotient is trivial: exp p D x j p [p] — exp p D x [p] . 

Finally, we link intrinsic to Ziezold means. The gradient of the mapping fi nt : 
T p M [0, oo) defined by = d M (p, exp p v) 2 is given by grad/^(-i;) = 2v. 

Hence, we have for pi,p2 £ M \ C(p) that 

grad/m^exPp 1 ^) = grad(/^)(exp p 1 p 2 ) <=> Pi = P2 

In view of Ziezold means let f^l : T p M —> [0, cx)) be defined by / e ^(w) = 
exp p v|| 2 /2. Mimicking the above property introduce the following condition 

grad(/ e ( ^)(exp p : 1 pi) = grad^^exp" 1 p 2 ) Pi = P2 (3) 

for pi,p2 £ M\ C(p). E.g. on spheres, © is valid. Using the residual distance 
instead, the analog to © is only true on half-spheres. 



4.2 Stability of Sample Means 

For short, for sampled points pi,...,p n £ M denote the set of intrinsic or extrin- 
sic sample means, respectively, by E(pi, . . . ,p n )', similarly for qi, . . . , q n £ Q de- 
note the set of intrinsic or Ziezold sample means, respectively, by E(qi, . . . , q n ). 
Obviously, both sets are non-void. 
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Lemma 4.1. In case of extrinsic and Ziezold means assume that optimal po- 
sitioning is invariant. Let q\ .. . ,q n £ Q and q £ E{qi, ■ ■ ■ ,q n )- If P G Q and 
Pl G qi, . . . ,p n G q n are in optimal position to p then p £ E(pi, . . . , p ra ). 

Proof. The proof in case of extrinsic and Ziezold means is completely analogous 
to the following case of intrinsic means. If the assertion would not be true, we 
would have with p £ E(pi, . . . ,p n ) that 

n n 

^dQdpl^jf = ^2d M (p,gjPjf 
.7=1 i=i 

n n 

i=i i=i 

for gipi , . . . , g n Pn in optimal position to p, a contradiction to the hypothesis 
q £ E(q 1 ,...,q n ) □ 



The following necessary condition is taken from (jKobavashi and Nomizul . 
19691 p. 110). 



Lemma 4.2. In case of intrinsic means let p £ M and p%, . . . ,p n £ M \ C(p) 
with p £ E(pi, . . . ,p n ) for a complete and connected manifold M . Then 

n 

^exp-Vj =0. 

3=1 

For extrinsic means the analog condition is 

j^f'StiP^Pj) - 0. (4) 

3=1 

Theorem 4.3. Suppose that G is a Lie group acting properly and isometri- 
cally on a complete connected Riemannian manifold M giving rise to the nat- 
ural quotient Q = M/G. If q £ E(q\, . . . ,q n ) is an intrinsic sample mean of 
qi,...,q n G Q \ C(q) then 

with arbitrary p £ q and px G qi, . . . ,p n G q n in optimal position to p. In case 
of invariant optimal positioning and under condition {3J) the same assertion is 
true ifq is a Ziezold sample mean. 

Proof. The proof in case of Ziezold means is completely analogous to the fol- 
lowing case of intrinsic means, using Q instead of Lemma T4.2I If the assertion 
would be false, we may assume that 7p (f_ I pi , i.e. that there is g £ G with 
gp = p but gp\ 7^ pi. In consequence of Lemma 14.11 p is an intrinsic mean of 
both pi, . . . ,p n and gpi,p2, . . . ,p n , both being in optimal position to p. Since 



9 



by hypothesis, exp- 1 (gpi) ^ exp- 1 ^) is well defined, we have in consequence 
of Lemma 14.21 the contradiction 

n 

n 

= exp^ 1 9V\ ~ exp^ 1 pi + ^ exp F ^ . 

3=1 

□ 

Corollary 4.4. Suppose that G is a Lie group acting properly and isometrically 
on a complete connected Riemannian manifold M giving rise to the natural 
quotient Q = M/G. Then every intrinsic mean q £ Q of a sample qi, . . . , q n £ 
Q \ C(q) is regular if the sample contains at least one regular point. In case 
of invariant optimal positioning and under condition f3J) the same assertion is 
true for Ziezold sample means. 

Proof. The assertion is a consequence of Theorem 14.31 and the well known fact 
that regular points p* € M* are characterized by the property that for every 
p E M there is a g — g p £ G such that glp-g^ 1 C I p : if additionally p is in 
optimal position to p* then g G I p . □ 

4.3 Stability of Population Means 

Theorem 4.5. Assume that X is a random element on the canonical quotient 
Q = M/G due to the isometric and proper action of a Lie group G on a complete 
Riemannian manifold M . Let p £ M and suppose that X is supported on ir(U) 
with a tubular neigborhood U € M of [p] that admits a slice exp p D x j p [p] = U, 
D e H p M. IfV{X £ Q ( V)} ^ for some p' £ U and if either [p] 

(i) is an intrinsic mean of X or 

(ii) is a Ziezold mean of X while optimal positioning is invariant and ^ is 
valid, 

then p' is of lower orbit type than p. 

The proof of Theorem 14.51 further below relies on the two following lemmas 
for which we first develop an additional concept. 

In the following, measurable will refer to the corresponding Borel cr-algebras, 
respectively. 

Definition 4.6. Call a measurable subset L C M a measurable horizontal lift 
of a subset R of M/G in optimal position to p £ M if 

1. the canonical projection L — > R C M/G surjective, 

2. every p' £ L is in optimal position to p. 
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3. every orbit [p'\ of p' £ L meets L once. 

Lemma 4.7. Let U C M be a tubular neighborhood about p £ M that admits a 
slice exp p D x j [p] = U . Then, there is a measurable horizontal lift L C exp p D 
ofn(U). 

Proof. If p is regular, then L = exp p D has the desired properties, cf. (E) 
above. Now assume that p is not of maximal orbit type. W.l.o.g. assume that 
D contains the closed ball B of radius r > with bounding sphere S = dB and 
that there are p±, . . . ,p,j € exp p (S') having the distinct orbit types orccuring in 
S. Si denotes all points on S of orbit type (G/I Pj ), j = 1, . . ., J, respectively. 
Observe that each S J is a manifold on which I p acts properly and isometrically. 
Hence for every 1 < j < J, there is a finite (Kj < oo) or countable (Kj = oo) 
sequence of tubular neighborhoods U° k C covering , admitting trivial slices 

expj D{ x {//,-; :geI pJ } = Ul, l<k< l\ , . 

Pk r k 

Here, exp 5 ^ denotes the Riemann exponential of . Defining a disjoint sequence 

Pk 

U{ : U{ 
Ul +1 := ^ +1 \^forl<fc<i^-l 

exhausting S J we obtain a corresponding sequence of disjoint measurable sets 
D{ with 

expg Di x {gp{ ijeJ^S^, 1 < k < K, . 

Setting 

Li : cx P ; Di and V := [j L{ 

k—l 

observe that every p' G S 3 has a unique lift in L 3 which is contained in a unique 
L 3 k . This lift is by construction in optimal position to p. If gp' € L 3 k , for some 
g E G and 1 < fc' < ifj we have by the disjoint construction of U 3 k and U k , 
that k = k' , hence the isotropy groups of gp' and p' agree, yielding gp' — p' . In 
consequence, L 3 is a measurable horizontal lift of S 3 in optimal position to p. 
Since every horizontal geodesic segment t i— > exp p (iw), w € H p M contained in 
exp p D features a constant isotropy group, except possibly for the initial point 
we obtain with the definition of 

L := {exp p (iw) £ exp p D : v £ exp~ 1 (L : ' ) for some 1 < j < J, t > 0} 

a measurable horizontal lift of ir(U) in optimal position to p. □ 

The following Lemma 14.81 is the analog in the population case of Lemmas 
14.11 and 14.21 The proof goes similar. 
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Lemma 4.8. Assume that X is a random element on the canonical quotient 
Q = AI/G due to the isometric and proper action of a Lie group G on a complete 
Riemannian manifold M . Moreover, assume that there is p £ M with tubular 
neigborhood U £ M of [p] that admits a slice exp p D x j p [p] = U such that X 
is supported by tt(U). With a measurable horizontal lift L of ir(U) define the 
random element YonL<zMbyiroY = X. If [p] is an intrinsic mean of X 
on Q, then p is an intrinsic mean of Y on M and 

E(exp- 1 F) =0. 

If [p] is a Ziezold mean of X on Q and optimal positioning is invariant, then p 
is an extrinsic mean ofY on M and 

E(grad(/ e;c4 )(exp; 1 r)) =0. 

Proof of Theorem \4-5\ The proof in case of Ziezold means is completely analo- 
gous to the following case of intrinsic means. With the hypotheses of Theorem 
14.51 suppose that L is a horizontal measurable lift of will) guaranteed by Lemma 
14.71 through an intrinsic mean p € M of the random element Y on M defined 
as in Lemma T4.8I with [p] £ E dQ (X). With the notation of Lemma 14.71 and its 
proof, if the assertion of the Theorem would be false, w.l.o.g. there would be a 
qj £ S J with gpj ^ pj for some g € I p and P{Y £ AP} > for some 1 < j < J 
with 

AP := {exp p (to) £ exp p (L») : v £ exp" 1 ^'),* > 0} 

In particular, w.l.o.g. in the proof Lemma 14.71 we may choose an arbitrarily 
small Lj around pj such that 

f (exp^Y-exp-HgY^dPy ? 0. 

with All := { ex P P (t v ) e ex Pj9(-0) : v e ex Pp 1 (^fc)'^ — 0}- Suppose that L 
is obtained as in the proof of Lemma 14.71 by using Lj, and suppose that L' 
is obtained from L by replacing L J k with gLj,. Then L' is also a measurable 
horizontal lift in optimal position to p. Moreover with the lifts Y' of X to L' 
and Y of X to L guaranteed by Lemma [4T51 we have that 

0=[ exp- l YdPy- / exp^r'dP^ 

= f (exp^Y-exp^idgY^dPy £ 0, 

a contradiction to the above, completing the proof. □ 
We have at once the population analog to Corollary 14.41 

Corollary 4.9. Suppose that a random pre-shape Y is distributed on M abso- 
lutely continuous w.r.t. Riemmanian measure and supported by a tubular neigh- 
borhood that admits a slice through p £ AI . Let X = n o Y be the corresponding 
random shape. Then p is regular if [p] is an intrinsic mean of X , or if \p] is a 
Ziezold mean, optimal positioning is invariant and is valid. 
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5 Extrinsic Means for Kendall's Shape Spaces 



Let us recall the well known Veronese-Whitney embedding for Kendall's planar 
shape spaces £*. Identify F k with C fe_1 \ {0} such that every landmark column 
corresponds to a complex number. This means in particular that z 6 C fc_1 
is a complex row(!)-vector. With the Hermitian conjugate a* — (a]tj) of a 
complex matrix a = (djk) the pre-shape sphere S/f i s identified with {z £ 
C k-i . zz * = X } on which so{ 2) identified with S 1 = {A e C : |A| = 1} acts by 
complex scalar multiplication. Then the well known Hopf-Fibration mapping to 
complex projective space gives — S^/S 1 = CP k ~ 2 . Moreover, denoting with 
M(k — 1, k — 1, C) all complex (k — 1) x (k — 1) matrices, the Veronese- Whitney 
embedding is given by 

S^/S 1 -> {a e M(k - 1, k - 1, C) : a* = a} 
[z] 1 — ^ 

Remark 5.1. T/ie Procrusets metric of is isometric with the Euclidean 
metric of M(k — 1, k — 1, C) since we /icroe (z, u>) = Re(zt<;*) for z, w £ S k and 
hence, d^([z], [w]) — \fl — wz*zw* — \\w*w — z*z\\/y2. 

The idea of the Veronse- Whitney embedding can be transferred to the gen- 
eral case of shapes of arbitrary dimension m > 2. Even though the embedding 
given below is apt only for reflection shape space it can be applied to prac- 
tical situations in similarity shape analysis whenever the geometrical objects 
considered have a common orientation. As above, the number k of landmarks 
is essential and will be considered fixed throughout this section; the dimension 
1 < m < k, however, is lost in the embedding and needs to be retrieved by pro- 
jection. To this end recall the (non-unique) embedding of Sj in S k n (1 < j < rn) 
in (P) which gives rise to a unique and canonical embedding of i?E™ in i?E^. 
Moreover, consider the strata 

(RE^Y := {[x] e i?St : rank(z) = j}, (X k m ¥ := {[x] e E k m : rank(x) = 3} 

for j = 1, . . . , m, each of which carries a canonical manifold structure; due to 
the above embedding, (RY, k n y can and will be identified with (RS k y such that 

m 

Rn k m = [j(R^y. 

At this point we note that SO(m) is connected, while 0(m) is not; and the 
consequences for the respective manifold parts, i.e. points of maximal orbit 
type: 

fX^k \# fS~*fa \ m — 1 1 1 f \ m 
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Similarly, we have a stratifiction 

fc-i 

V := {a e M(k- l,k- 1) : a = a T > 0,tr(a) = l} = (J 7" 

3=1 

of a bounded flat convex manifold V with non-flat unbounded manifolds 

V j :={aeV : rank(a) = j} (j = 1, . . . , k - 1) , 

all embedded in M(fc — 1, fc — 1). The Schoenberg map s : iffi*, -> "P is then 
defined on each stratum by 

[x] i-> x T a; 

For x £ Sj recall the tangent space decomposition T x Sj — T x [x] © H x Sj into 
the vertical tangent space along the fiber [x] and its orthogonal c omplement the 
horizontal tangent space. For x E (Sj)-* identify canonically (cf. iKendall et al 
(|l999l p. 109)): 



T lx] (HEW = H X S* = {wE M(j, k-l): ti{wx T ) = 0, wx T = xw T } . 



Then the assert ion of the following T heorem condenses results of lBandulasiri and Patrangenaru 
(|2005l) . cf. also lDrvden et all (|2008h . 



Theorem 5.2. Each s- 7 is a diffeomorphism with inverse (s^)~ l (a) = |(\/Au T ){] 
where a = uAu T im'i/i u G 0(k— 1), A = diag(Ai, . . . , A m ), and = Xj+i = . . . = 
Afc_i in case of j < k — 1. Here, (a){ denotes the matrix obtained from taking 
only the first j rows from a. For x E Sj and w E H x Sj = T^(KSj) J the 
derivative is given by 

d(s-' ) r x ] w — x T w + w T x . 

Remark 5.3. In contrast to the Vernose- Whitney embedding, the Schoenberg 
embedding is not isometric as the example of 

cos 6 \ / sin^> \ / cose 



siiu. j Wl \ -cos0 j ' W2 \ siiw. 

ieac/ies: 

||x T wi + a; || = v22| cos ^ sin 0| , ||a; T W2 + uiJ.t|| = v2. 

Since T 3 is bounded, convex and Euclidean, the classical expectation E(X T ) G 
■p- 7 for some 1 < j < fc — 1 of the Schoenberg image X T X of an arbitrary random 
reflection shape [X] € is well defined. Then we have at once the following 

relation between rank of Euclidean mean and increasing sample size. 
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Figure 1: Projections (if 
existent) of two points 
(crosses) in the X-plane 
to the open line segment 
A = {(Ai,A 2 ) : Ai + A 2 = 
l,Ai,Aa > 0}. The dotted 
line gives the central pro- 
jections ( den oted by stars) 
sugges ted by \Dryden et all 
(2008 ) which is well defined 
for all symmetric, positive 
definite matrices (corresponding to the first open guadrant), the dashed line 
gives the orthogonal projection ( circle ) which is well defined in the triangle 
below A ( corresponding to V) and above A in an open strip. In particular it 
exists not for the right point. 




Theorem 5.4. Suppose that a random reflection shape [X] G -RS^ is distribed 
absolutely continous w.r.t. the projection of the spherical measure on S!^. Then 

1 - 

E(X T X) e V^ 1 and - V Xj X { € V 3 a.s. 
n ^-^ 

i=l 

for every i.i.d. sample X±, . . . , X n ~ X if j < nm < j + 1 for nm < k — 1 and 
j = k — 1 for nm > k — 1 . 

Hence, in stastical settings involving a higher number of landmarks, a suffi- 
ciently well behaved projection of a high rank Euclidean mean onto lower rank 
V r = (£y) r , usually m — r, is to be employed, giving at once a mean shape 
satisfying strong consistency and a CLT. Here, unlike to intrinsic or Procrustes 
analysis, the dimension r chosen is crucial for the dimensionality of the mean 
obtained. 

The orthogonal projection 

(f> r : UtTr V* -> V 

a i — y argmin fce pr tr ((a — 6) 2 ) 

giving the set of extrinsic Schoenberg means has been computed bv lBhattacharva 
(|2008li : 

Theorem 5.5. For 1 < r < k - 1, a = u\u T € V with u E 0{k - 1), A = 
diag(Ai, . . . , A m ), Ai > . . . > Afe_i and A r > the orthogonal projection onto 
V r is given by 

4> r {a) = u\xu T 
with /i = diag(/ii, . . . , jj, r , 0, . . . , 0), 

Hi = A, : + - - X r (i = 1, . . . ,r) 
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and \ r = - Y^—i Ai < i which is uniquely determined if and only if X r > \ r +i ■ 

With the notation of Theorem l5.51 a non-orthogonal central pr ojection ^ r (a) 
uvu T equally well and uniquely determined has been proposed bv lDrvden et al. 
(|2008h with v = diag(i/i,...,i/ r ,0,...,0), 

v% = -=- (i = l,...,r). 

Orthogonal and central projection are depicted in Figure O 



6 Local Effects of Curvature 

In this section we assume that a manifold stratum M supporting a random 
element X is isometrically embedded in a Euclidean space M. D of dimension 
D > 0. With the orthogonal projection $ : IR 15 — > M from Section [3] and the 
Riemann exponential exp p of M at p we have the 

intrinsic tangent space coordinate expp 1 ^ and the 
residual tangent space coordinate d<& p (X — p) , 

respectively, of X at p, if existent. 



6.1 Finite Power of Tests and Tangent Space Coordinates 

With the above setup, assume that /i £ M is a unique mean of X. If c £ M. D 
is the center of the osculatory circle touching the geodesic segment in M from 
X to /i at /j, with radius r, and if X r is the orthogonal projection of X to that 
circle, then X = X r + 0(\\X — (J>\\ 3 ). Moreover, with 

/ X-c u-c\ 

C08a = \W=4'—/ 

= ^(X r -c, f i-c)+0(\\X-ri\ 3 ) 
we have the residual tangent space coordinate 

v = X — c — — I IX — ell cos a = X r — c — (fj, — c) cos a + 0(\\X — /dl 3 ) 

r 

having squared length ||w|| 2 = r 2 sina 2 + 0(\\X — /i|| 3 ). By isometry of the 
embedding, the intrinsic tangent space coordinate is given by 

VOL o 

exp M X=— v + 0(\\X ~ vf) ■ 

With the component 

v = fi- c-\\X - c||- — - cos a = (fj, - c)(l - cos a) + 0(\\X - fi\\ 3 ) 
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of X normal to the mentioned geodesic segment of squared length \\v\\ 2 = r 2 (l — 
cos a) 2 + 0(\\X - /^|| 3 ) we obtain 

||exp M X|| 2 = || u ||V|M| 2 +0(||X-/i|| 8 ), 

since 

a 4 

(1 - cos a) 2 + sin 2 a = 2(1 -cos a) = a 2 + 2 — H 

and a = 0(\\X — fi\\). In consequence we have 

Remark 6.1. In approximation, the variation of intrinsic tangent space coor- 
dinates is the sum of the variation \\v\\ 2 of residual tangent space coordinates 
and the variation normal to it. In particular, for spheres 

Hexp^H 2 > H 2 +H 2 . 

Since the variation in normal space is irrelevant for a two-sample test for equal- 
ity of means, say, a higher power for tests based on intrinsic means can be 
expected when solely residual tangent space coordinates obtained from an iso- 
metric embedding are used rather than intrinsic tangent space coordinates. 

Note that the natural tangent space coordinates for Ziezold means are resid- 
ual. 

A simulated classification example in Section [7] illustrates this slight effect. 



6.2 The 1:3- Property for Spherical and Kendall Shape 
Means 

In this section M = S ^ 1 C M. D is the (D — l)-dimensional unit-hypersphere 
embedded isometrically in Euclidean Z?-dimensional space. The orthogonal pro- 
jection $ : R D —> S ^ 1 : p — > ^ is well defined except for the origin p = 0, 

and the normal space at p £ S ^ 1 is spanned by p itself. In consequence, a 
random point X on S ^ 1 has 

ct 

d$JX —p) = X -pcosa, expJX) = d§ p (X - p) 

' sin a 

as residual and intrinsic, resp., tangent space coordinate at —X / p £ S ^ 1 
where cos a = (X,p). 

Remark 6.2. An adaption (i.e. multiplying by the respective ^^-^ in every 
iterati on step) of the algorithm for intrinsic means in Huckemann and Ziezold 



\200a . Section 5.4) gives at once an algorithm converging usually quickly to one 
of the two sample residual means. 

Theorem 6.3. If X is contained in a closed half sphere with non-zero probability 
in its interior, it has a unique intrinsic mean which is assumed in the interior 
of that half sphere. 
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Proof. Below, we show that e very intrinsic m ean necessarily lies within the 
interior of the half sphere. Then, iKendalll (|l990l Theorem 7.3) yields uniqueness. 



W.l.o.g. let X = (sin0,x 2 , . . . ,x n ) such that V{si\i4> < 0} = < 7>{sin0 > 0} 
and assume that p = (sin-0,P2, ■ ■ ■ ,Pn) G S ^ 1 is an intrinsic mean. Moreover 
let p' = (sin(|^|),p2) ■ • ■ ,Pn)- Since 

IE (|| exp p (X)|| 2 ) = E (arccos 2 (p,X)) 



= E I arccos 2 I sin ip sin <f> + pjXj 

V V ^= 2 

< E(||ex Pp ,pO|| 2 ) 



with equality if and only if = if), this can only happen for ip > 0. Now, 
suppose that p = (0,j>2, ■ • ■ ,Pn) is an intrinsic mean. For deterministic ip > 
consider p(ip) — (sin ip,p± cosip, . . . ,p n cosip). Then 



E 



^|| exp p (,0)(X)|| 2 ^ = E | arccos 2 | sin ip sine/) + cos </>y^p-,-x 



j 

3=2 



= E(||fflq) p (X)|| a )-Ci^ + 0(^) 

with C\ > since "P{sin</> > 0} > 0. In consequence, p cannot be an intrinsic 
mean. Hence, we have shown that every intrinsic mean is contained in the 
interior of the half sphere. □ 

Remark 6.4. For the special case of spheres, this extends a result of H. Le, 
asserting a unique intrinsic mean on general Riemannian manifolds, if among 
other s, the support of X is contained in a geodesic qu arter-ball, see\L^ l[20oI . 
200^) - H, Le's result is an extension o AKarcheA \l9li) . asserting uniqueness of 



intrinsic means in geodesic balls if among others, X is supported in a geodesic 
quarter ball. Intrinsic means restricted t o geodesic balls are often called Karcher 
means. As mentioned in the above vroof. [Kendall 1 199(\ , Theorem 7.3) extended 



Marcher's result to obtain uniquenes of Karcher means, if among others, the 
support of X is contained in a half ball. In view of Theorem \6.3\ one might expect 
to extend Le's result on uniquenes of intrinsic means on the entire manifold 
accordingly, to random elements contained in a geodesic half ball. 

The following Theorem characterizes the three spherical means. 

Theorem 6.5. Let X be a random point on S D ~ X . Then x^ G S ^ 1 is the 
unique extrinsic mean if and only if the Euclidean mean E(X) = J s d-i X dPx is 
non-zero. If the following right hand sides are non-zero, then there are suitable 
\^ = \\E(X)\\ > 0, AM > and A« > such that 

A (e) a; (e) = E(X) , 

every residual mean x^ G S ^ 1 satisfies 

A (r V r) = E((X : x^)X) , 
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and every intrinsic mean x^ £ S D 1 satisfies 



/n fte 2as< case we additionally require that E 



( ar j ccos(x ' a:< " ) (X.xW)^ > which 



is in particular the case if X is contained in a closed half sphere with non-zero 
probability in the interior. 



Proof. The assertions for the extrinsic mean are well known from lHendriks et al 



1996). The second assertion for residual means follows from minimization of 

/ \\p-(p,x)x\\ 2 dP x (p) = l- f (p,x) 2 dP x ( P ) 

with respect to x £ M. D under the constraining condition \\x\\ = 1. Using a 
Lagrange ansatz this leads to the necessary condition 

(p, x) p dP x (p) = \x 

gD-1 

with a Lagrange multiplier A of value E((X,x) 2 ) which is positive unless X is 
supported by the hypersphere orthogonal to x. In that case, by Theorem l3.3l x 
cannot be a residual mean of X, as every residual mean is as well contained in 
that hypersphere. Hence, we have A > 0. The same method applied to 

exp" 1 (p) 1 1 2 dP x (p) = / arccos 2 ((p, x) ) dP x (p) 
Js - 1 

taking into account Theorem 16.31 yields the third assertion on the intrisic mean. 

□ 

In particular, residual means are eigenvectors to the largest eigenvalue of the 
matrix E(XX T ). As such, they rather reflect the mode then the classical mean 
of a distribution: 

Example 6.6. Consider 7 € (0,7r) and a random variable X on the unit circle 
{e 10 : 9 £ [0, 2ir)} which takes the value 1 with probability 2/3 and e n with prob- 
ability 1/3. Then, explicit computation gives the unique intrinsic and extrinsic 
mean as well as the two residual means 

x (i) — gij x (e) _ e »arctan ^'"J^ ^ ^(r) _ j_ £ »| arctan _ 

Figure [H shows the case 7 = ^ . 

In contrast to Figure [2 one may assume in many practical applications that 
the mutual proximities of the unique intrinsic mean the unique extrinsic 
mean x^ and the unique residual mean x^ r °^ closer to x^ are rather small, 
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Figure 2: Means on a circle of a distri- 
bution taking the upper dotted value with 
probability 1/3 and the lower right dot- 
ted value with probability 2/3. The lat- 
ter happens to be one of the two residual 
means. 



namely of the same order as the squared proximity of the modulus ||E(X)|| of 
the Euclidean mean to 1 , cf . Table [1] We will use the following condition 



\ x {e) _ x {ro)\\ \\ x {e)_ x {i)\ 



0((1-||E(X)||) 2 ) (5) 



with the concentration parameter 1 — | E(X)||. 

Corollary 6.7. Under condition {5j), if all three means are unique, then the 
great circular segment between the residual x^ ^ mean closer to the extrinsic 
mean x^ and the intrinsic mean x^ 1 ' is divided by the extrinsic mean in ap- 
proximation by the ratio 1:3: 

X - AM I ||E(X)|| +U[[i 



{l) ||E(X)|| ( (e) ! E({X-xU,X)X) 

AW y 3 ||E(X)|| 

with A^ and A^ 1 * - 1 from Theorem \6.5\ 



0((1 




Proof. For any x,p G S D 1 decompose p — x = p — (x,p) x — z(x,p)x with 
z(x,p) = 1 — (x,p), the length of the part of p normal to the tangent space at 
x. Note that E [z{x^ e \X) = 1 - || E(X)||. Now, under condition ©, verify the 
first assertion using Theorem 16.51 



x 



>°) = j L j {e(X)-E(z(x^\X)X)) 



On the other hand since 



arccosfl — z) 1 2 9 

= 1 + -Z + — z 2 + 



y/l - (1-z) 2 3 15 
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we obtain with the same argument the second assertion 

x(i) = 1 f arcco S (X,xffl) \ 
y/l-(X,x^) 2 I 




L (l(X) + 1 E ,X)X)|^E , X) 2 x) + . . .) 

™ + E X) x) + ((l - || E(X)II)*)) 



□ 

Remark 6.8. TTie tangent vector defining the great circle approximately con- 
necting the three means is obtained from correcting with the expected normal 
component of any of the means. As numerical experiments show, this great 
circle is different from t he fir st principal component geodesic as defined in 



Huckemann and Ziezold] t200d) . 



Recall the following connection between intrinsic and extrinsic spherical ver- 
sus intrinsic and Ziezold shape means, respectively, cf. Lemmas 14.11 and 14.81 It 
is easy to see the corresponding connection for residual versus Procrustes means. 

Remark 6.9. Suppose that a random shape X on is supported by R C Sj, 
admitting a horizontal measurable lift L C in optimal position to p 6 . 
Define the random variable Y on by it o Y = X . Then we have that 

if [p] is an intrinsic mean of X then p is an intrinsic mean of Y , 

if [p] is a Procrustes mean of X then p is a residual mean of Y , 

if [p] is a Ziezold mean of X then p is an extrinsic mean of Y 

In consequence, Corollary 16 . 71 extends at once to Kendall's shape spaces. 

Corollary 6.10. Suppose that a random shape X on with unique intrinsic 
mean fi^\ unique Ziezold mean fi^ and unique Procrustes mean // Po ' closer to 
fjM^ is supported by R C E^, admitting a horizontal measurable lift L C in 
optimal position to ijS- z ' € S^. If the means are sufficiently close to each other 
in the sense of 

d E *>w -/>>)), d & j^-^) = o((i-||E(r)||) 2 ) 

with the random pre-shape Y on L defined by X = it o Y , then the generalized 
geodesic segment between //W and fi^ ^ is approximately divided by by the 
ratio 1 : 3 with an error of order 0((1 — || E(Y)||) 2 ). 
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7 Examples: Exemplary Datasets and Simula- 
tions 



7.1 The 1 : 3 property 

In the first example we illustrate Corollary 16. 101 on the basis of four data sets: 

poplar leaves: contains 104 quadrangular planar shapes extracted from poplar 
leaves in a joint collaboration with Institute for Forest Biomet r y and In- 
formatics at the University of Gdttingen, cf. iHuckemann et al. I d2009h . 

digits '3': contains 30 planar shapes with 13 landmarks e ach, extracted from 
handwritten digits '3', cf. iDrvden and Mardia (1998, p. 318). 

macaque skulls: contains three-dimensional sh a pes w ith 7 landmarks each, of 
18 macaque skulls, cf. IDrvden and Mardial (|1998l p. 16). 

iron age brooches: contains 28 t hree-dimensional tetrahedral shapes of iron 
age brooches, cf. ISmalll (|1996l Section 3.5). 



means of 104 poplar leaf shapes 



0.00025" 



-0.00015 



-0.00005 



means of 30 digits '3' 



-0.008 



-0.004 



"O7U00 



means of 18 macaque skulls 




means of 28 Iron age brooches 



8e-05 -6e-05 -4e-05 -2e-05 OeTOO 



-0.0020 



-0.0010 



"0UOO0 



Figure 3: Depicting shape means for four typical data sets: intrinsic (star), 
Ziezold (circle) and Procrustes (diamond) projected to the tangent space at the 
intrinsic mean. The cross divides the generalized geodesic segment joining the 
intrinsic with the Procrustes mean by the ratio 1:3. 
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data set 








(i-|E(y)|) 2 


poplar leaves 


6.05e - 05 


1.83e - 04 


2.44e - 04 


5.24e-05 


digits '3' 


0.00154 


0.00452 


0.00605 


0.00155 


macaque skulls 


1.96e-05 


5.89e - 05 


7.85e - 05 


7.59e - 06 


iron age brooches 


0.000578 


0.001713 


0.002291 


0.000217 



Table 1: Mutual shape distances between intrinsic mean fx^', Ziezold mean jjS z ^ 
and Procrustes mean closer to the Ziezold mean for various data sets. 

Right column: the squared distance between modulus of Euclidean mean and 1, 
cf. Corollary \6.1(A 

As clearly visible from Figure [3] and Table [TJ the approximation of Corollary 
16.101 for two- and three-dimensional shapes is highly accurate for data of little 
dispersion (the macaque skull data) and still fairly accurate for highly dispersed 
data (the digits '3' data). 



Q 



intrinsic 
mean 



Ziezold 
mean 



Procrustes 
mean 



Figure 4: A data set of three planar triangles (top row) with its corresponding 
intrinsic mean (bottom left), Ziezold mean (bottom center) and Procrustes mean 
(bottom right). 
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q1 q2 q 




intrinsic intrinsic 
mean(q1,q2) mean (q1,q2,q) 




Schoenberg Schoenberg 
mean(q1,q2) mean (q1,q2,q) 




Figure 5: Planar triangles q\ — x cos f3 — u>2 sin j3, 92 = x cos /3 — W2 sin /3 and 
q = xcos/3' + wi sin/3' with x,Wi,W2 from Remark \5.3\. (f> = 0.05, ft = 0.3 = 
(top row). Intrinsic means (middle row) of sample (91,^2) (left) and (91,92,9) 
(right). Schoenberg means (bottom row) of sample (91,92) (left) and (91,92,9) 
(right). 

7.2 "Blindness" of Procrustes and Schoenberg Means 

In the second example we illustrate an effect of "blindness to data" of Procrustes 
means and Schoenberg means. The former blindness is due to the affinity of the 
Procrustes mean to the mode in conjunction with curvature, the latter is due 
to non-isometry of the Schoenberg embedding. While the former effect occurs 
only for some highly dispersed data when the analog of condition ([5]) is violated, 
the latter effect is local in nature and may occur for concentrated data as well. 

Example 7.1. Reenacting the situation of Examvle \6. 6\ and Figure^ the shapes 
of the triangles P and Q in Figure [^] are almost maximally remote. Since the 
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mode P is assumed twice and Q only once, the Procrustes mean is nearly blind 
to Q. In case of 



the Procrustes mean shape of the sample P, P, Q is the one- dimensional P be- 
cause of blindness to the two-dimensional Q. 

Let us record this fact. 

Remark 7.2. The Procrustes mean of a random shape assuming both regular 
and degenerate shapes with non-zero probability may be degenerate. 

Even though Schoenberg means have been introduced to tackle 3D shapes, 
the effect of "blindness" can be well illustrated already for 2D. To this end con- 
sider x = x(cj>), wi — wi(4>) and W2 = W2{4>) as introduced in Remark l5.3l Along 
the horizontal geodesic through x with initial velocity W2 we pick two points 
qi = x cos j3 + W2 sin (3 and q2 — x cos j3 — u>2 sin (3. On the orthogonal horizontal 
geodesic through x with initial velocity w\ pick q = xcosfl' + w\ sin/3'. Recall 
from Remark 1 5. 31 that along that geodesic the derivative of the Schoenberg em- 
bedding can be made arbitrarily small for (f> near 0. Indeed, Figure [5] illustrates 
that for small 4>, the Schoenberg mean is nearly unchanged if the triangle q is 
added to the sample qi,q2- The corresponding intrinsic means, however, are 
sensitive to the sample's expansions. 



7.3 Classification Power of a Test 

In the ultimate example we illustrate the consequences of the choice of tangent 
space coordinates and the effect of the tendency of the Schoenberg mean to in- 
crease dimension by a classification simulation. To this end we apply a Hotclling 
T 2 -test to discriminate the shapes of 10 noisy samples of regular unit cubes from 
the shapes of 10 noisy samples of pyramids with top section chopped off, each 





Figure 6: cube (left) and pyramid (right) for classification. 
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with 8 landmarks, given by the following configuration matrix 

/Of i±£ 1=1 o i 
1^ l=z 1 1 
\ e e 

(cf. Figured]) determined by e > 0. In the simulation, indenpendent Gaussian 
noise with variance a 2 > is added to each landmark measurement. Table [2] 
gives the number of correct classifications for 1,000 simulations for given a, e 
and significance level. 



cr 


c 


level 


intrinsic mean with 


Ziczold 


Procrustes 


Schoenberg 








residual 


intrinsic 


mean 


mean 


mean 








tangent 


space coordinates 








0.35 


0.05 


0.1 


522 


542 


538 


531 


442 


0.3 


0.1 


0.1 


558 


583 


581 


575 


483 


0.2 


0.2 


0.05 


556 


575 


574 


575 


512 


0.1 


0.3 


0.01 


359 


369 


369 


368 


366 


0.05 


0.35 


0.01 


764 


780 


780 


780 


809 



Table 2: Number of correct classifications within 1,000 simulations each of 10 
unit-cubes and pyramids determined by e (which gives the height), where each 
landmark is independently corrupted by Gaussian noise of variance a 2 , via a 
Hotelling T 2 -test for equality of means to the given significance level. 



As visible from Table [2j discriminating nearly flat pyramids (e = 0.05) from 
cubes is achieved much better by employing intrinsic, Ziezold or Procrustes 
means rather than Schoenberg means. This finding is in concord with Theo- 
rem [5T] samples of size 10 of three-dimensional configurations yield Euclidean 
means a.s. in V 7 which are projected to "P 3 to obtain Schoenberg means in 
E|. In consequence, Schoenberg means of nearly two-dimensional pyramids are 
essentially three dimensional. With increased height of the pyramid (e = 0.35), 
i.e. for more pronounced third dimension and increased proximity to the unit 
cube), all means perform almost equally well, with a tendency of the Schoen- 
berg mean to eventually outperform the others. Moreover in any case, intrinsic 
means with intrinsic tangent space coordinates behave much poorer in view of 
shape discrimination than intrinsic means with residual tangent space coordi- 
nates, cf. Remark 16.11 The latter (intrinsic means with residual tangent space 
coordinates) are better or equally good as Ziezold and Procrustes means (which 
naturally use residual tangent space coordinates). 



intrinsic mean 


Ziezold mean 


Procrustes mean 


Schoenberg mean 


235.167 


182.335 


225.982 


35.595 



Table 3: Average time in seconds for computation of 1,000 means of sample 
size 20. 



In conclusion we record the time for mean computation in Table [3] While 
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Ziezold means compute in approximately 3/4 of the computational time for 
intrinsic means, Schoenberg means are obtained approximately 5 times faster. 

Note that we have included inference based on Procrustes means for illus- 
tration, even though no stability result is available and hence at this point, it 
cannot be theoretically justified that the CLT holds for Procrustes means as 
well. 



8 Discussion 

By establishing stability results for intrinsic and Ziezold means on the manifold 
part of a shape space, a gap in asymptotic theory for general non-manifold 
shape spaces could be closed, now allowing for multi-sample tests of equality of 
intrinsic means and Ziezold means. A similar stability assertion in general is false 
for Procrustes means. Note that the argument applied to intrinsic and Ziezold 
means fails for Procrustes means, since in contrast to ([3]) the sum of Procrustes 
residuals is in general non-zero. The findings on dimensionality condense to 

Procrustes means may decrease dimension, 
intrinsic and Ziezold means preserve dimension, 
Schoenberg means tend to increase dimension. 

Due to the proximity of Ziezold and intrinsic means on Kendall's shape 
spaces in most practical applications, taking into account that the former are 
computationally easier accessable (optimally positioning and Euclidean aver- 
aging in every iteration step) than intrinsic means (optimally positioning and 
weighted averaging in every iteration step), in practical applications Ziezold 
means, may be preferred over intrinsic means. They may be even more pre- 
ferred over intrinsic means, since Ziezold means naturally come with residual 
tangent space coordinates which may allow in case of intrinsic means for a 
slightly higher finite power of tests than intrinsic tangent space coordinates. 

Computationally much faster (not relying on iteration at all) are Schoenberg 
means which are available for Kendall's reflection shape spaces. As a drawback, 
however, Schoneberg means, seem less sensitive for dimensionality of configura- 
tions considered than intrinsic or Ziezold means. In consequence, multi-sample 
test based on Schoenberg means may have a considerably lower power than 
when based on intrinsic or Ziezold means. 

We note that Zie zold means may b e defin ed for the shape spaces of planar 
curv es introduced b y Zahn and Roskiesl (Il972), whi ch are currently very popular 



e.g. iKlassen et all (|2004l ) or lSchmidt et alfpOOd ). Employing Ziezold means 



there, a computational advantage greater than found here can be expected since 
computation of iterates of intrinsic means involve computations of geodesies 
which themselves can only be found iteratively. 



27 



Acknowledgment 



The author would like to thank Alexander Lytchak for helpful advice on dif- 
ferential geometic issues. Also, the author gratefully acknowledges support by 
DFG grant MU 1230/10-1 and GRK 1023. 

References 

Bandulasiri, A., Patrangenaru, V., 2005. Algorithms for nonparametric infer- 
ence on shape manifolds. 

Bhattacharya, A., 2008. Statistical analysis on manifolds: A nonparametric 
approach for inference on shape spaces. , to appear. 

Bhattacharya, R. N., Patrangenaru, V., 2003. Large sample theory of intrinsic 
and extrinsic sample means on manifolds I. Ann. Statist. 31 (1), 1-29. 

Bhattacharya, R. N., Patrangenaru, V., 2005. Large sample theory of intrinsic 
and extrinsic sample moans on manifolds II. Ann. Statist. 33 (3), 1225-1259. 

Blum, H., Nagel, R. N., 1978. Shape description using weighted symmetric axis 
features. Pattern Recognition 10 (3), 167-180. 

Brcdon, G. E., 1972. Introduction to Compact Transformation Groups. Vol. 46 
of Pure and Applied Mathematics. Academic Press. 

Choquet, G., 1954. Theory of capacities. Ann. Inst. Fourier 5, 131-295. 

Drydcn, I. L., Kumc, A., Le, H., Wood, A. T. A., 2008. A multidimensional 
scaling approach to shape analysis. To appear. 

Drydcn, I. L., Mardia, K. V., 1998. Statistical Shape Analysis. Wiley, Chich- 
ester. 

Frechet, M., 1948. Les elements aleatoires de nature quelconque dans un espace 
distancie. Ann. Inst. H. Poincare 10 (4), 215-310. 

Gower, J. C, 1975. Generalized Procrustes analysis. Psychometrika 40, 33-51. 

Hcndriks, H., 1991. A Cramer- Rao type lower bound for estimators with values 
in a manifold. Journal of Multivariate Analysis 38, 245-261. 

Hendriks, FL, Landsman, Z., 1996. Asymptotic behaviour of sample mean loca- 
tion for manifolds. Statistics & Probability Letters 26, 169-178. 

Hendriks, H., Landsman, Z., 1998. Mean location and sample mean location 
on manifolds: asymptotics, tests, confidence regions. Journal of Multivariate 
Analysis 67, 227-243. 



28 



Hondriks, H., Landsman, Z., Ruymgaart, F., 1996. Asymptotic behaviour of 
sample mean direction for spheres. Journal of Multivariate Analysis 59, 141- 
152. 

Hobolth, A., Kent, J., Dryden, I., 2002. On the relation between edge and 
vertex modelling in shape analysis. Scandinavian Journal of Statistics 29 (3), 
355-374. 

Hotz, T., Huckemann, S., Gaffrey, D., Munk, A., Sloboda, B., 2010. Shape 
spaces for pre-alingend star-shaped objects in studying the growth of plants. 
Journal of the Royal Statistical Society, Series C 59 (1), 127-143. 

Huckemann, S., Hotz, T., Munk, A., 2009. Intrinsic MANOVA for Ricman- 
nian manifolds with an application to Kendall's space of planar shapes. IEEE 
Transactions on Pattern Analysis and Machine Intelligence. To appear. 

Huckemann, S., Ziezold, H., 2006. Principal component analysis for Riemannian 
manifolds with an application to triangular shape spaces. Adv. Appl. Prob. 
(SGSA) 38 (2), 299-319. 

Jupp, P. E., 1988. Residuals for directional data. J. Appl. Statist. 15 (2), 137- 
147. 

Karcher, H., 1977. Riemannian center of mass and mollifier smoothing. Com- 
munications on Pure and Applied Mathematics XXX, 509-541. 

Kendall, D., 1974. Foundations of a theory of random sets. Stochastic Geom., 
Tribute Memory Rollo Davidson, 322-376 (1974). 

Kendall, D. G., 1984. Shape manifolds, Procrustean metrics and complex pro- 
jective spaces. Bull. Lond. Math. Soc. 16 (2), 81-121. 

Kendall, D. G., Barden, D., Carne, T. K., Le, H., 1999. Shape and Shape 
Theory. Wiley, Chichester. 

Kendall, W. S., 1990. Probability, convexity, and harmonic maps with small 
image I: Uniqueness and fine existence. Proc. London Math. Soc. 61, 371- 
406. 

Klasscn, E., Srivastava, A., Mio, W., Joshi, S., Mar. 2004. Analysis on planar 
shapes using geodesic paths on shape spaces. IEEE Transactions on Pattern 
Analysis and Machine Intelligence 26 (3), 372-383. 

Kobayashi, S., Nomizu, K., 1969. Foundations of Differential Geometry. Vol. II. 
Wiley, Chichester. 

Krim, H., Yezzi, A. J. J. E., 2006. Statistics and Analysis of Shapes. Modeling 
and Simulation in Science, Engineering and Technology. Birkhauser, Boston. 

Le, H., Jun. 2001. Locating Frechet means with an application to shape spaces. 
Adv. Appl. Prob. (SGSA) 33 (2), 324-338. 



29 



Le, H., 2004. Estimation of Riemannian barycenters. LMS Journal of Compu- 
tation and Mathematics 7, 193-200. 

Mardia, K., Patrangenaru, V., 2001. On affine and projective shape data anal- 
ysis. Functional and Spatial Data Analysis, Proceedings of the 20th LASR 
Workshop (Eds: K.V. Mardia and R.G. Aykroyd), 39-45. 

Mardia, K., Patrangenaru, V., 2005. Directions and projective shapes. Ann. 
Statist. 33, 1666-1699. 

Matheron, G., 1975. Random sets and integral geometry. Wiley Series in Prob- 
ability and Mathematical Statistics. New York. 

Mosimann, J. E., 1970. Size allometry: Size and shape variables with charac- 
terizations of the lognormal and generalized gamma distributions. Journal of 
the American Statistical Association 65 (330), 930-945. 

Munk, A., Paige, R., Pang, J., Patrangenaru, V., Ruymgaart, F., 2008. The one- 
and multi-sample problem for functional data with application to projective 
shape analysis. Journal of Multivariate Analysis (99), 815-833. 

Nash, J., 1956. The imbedding problem for Riemannian manifolds. Ann. of 
Math. 63, 20-63. 

Palais, R. S., 1960. Slices and equivariant embeddings. In: Borel, A. (Ed.), 
Seminar on Transformation Groups. No. 46 in Annals of Mathematics Studies. 
Princeton Univ. Press, Princton NJ, Ch. 8, pp. 101-115. 

Schmidt, F. R., Clausen, M., Cremers, D., 2006. Shape matching by varia- 
tional computation of geodesies on a manifold. In: Pattern Recognition (Proc. 
DAGM). Vol. 4174 of LNCS. Springer, Berlin, Germany, pp. 142-151. 

Small, C. G., 1996. The Statistical Theory of Shape. Springer- Verlag, New York. 

Zahn, C, Roskies, R., 1972. Fourier descriptors for plane closed curves. IEEE 
Trans. Computers C-21, 269-281. 

Ziczold, H., 1977. Expected figures and a strong law of large numbers for random 
elements in quasi-metric spaces. Trans. 7th Prague Conf. Inf. Theory, Stat. 
Dec. Func., Random Processes A, 591-602. 

Ziezold, H., 1994. Mean figures and mean shapes applied to biological figure and 
shape distributions in the plane. Biom. J. (36), 491-510. 



30 



